Data
Filter results by:
## **Meta-Album Insects2 Dataset (Micro)** The pest insects dataset was originally created as a large scale benchmark dataset for Insect Pest Recognition (https://github.com/xpwu95/IP102). It contains…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album MPII Human Pose Dataset Dataset (Micro)** The MPII Human Pose dataset (http://human-pose.mpi-inf.mpg.de/#download) is a state of the art benchmark for evaluation of articulated human…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Plant Doc Dataset (Micro)** The PlantDoc dataset(https://github.com/pratikkayal/PlantDoc-Dataset) is made up of images of leaves of healthy and unhealthy plants. The images were…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Textures-ALOT Dataset (Micro)** Textures ALOT dataset (https://aloi.science.uva.nl/public_alot/) consists of 27 500 images from 250 categories. The images in the dataset are captured…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Animals with Attributes Dataset (Micro)** The original Animals with Attributes 2 (AWA) dataset (https://cvml.ist.ac.at/AwA2/) was designed to benchmark transfer-learning algorithms, in…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album RSD Dataset (Micro)** RSD46 dataset (https://github.com/RSIA-LIESMARS-WHU/RSD46-WHU) is collected from Google Earth and Tianditu. The collection contains 46 scene categories, with a…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Subcellular Human Protein Dataset (Micro)** This dataset is a subset of the Subcellular dataset in the Protein Atlas project(https://www.proteinatlas.org/). The original dataset, which…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album Boats Dataset (Micro)** The original version of the Meta-Album boats dataset is called MARVEL dataset (https://github.com/avaapm/marveldataset2016). It has more than 138 000 images of…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 800 missing values
## **Meta-Album DIBaS Dataset (Mini)** The Digital Images of Bacteria Species dataset (DIBaS) (https://github.com/gallardorafael/DIBaS-Dataset) is a dataset of 33 bacterial species with around 20…
0 runs0 likes0 downloads0 reach1 impact
1320 instances - 3 features - 33 classes - 1320 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
118 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 2 classes - 2 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
746 runs0 likes0 downloads0 reach0 impact
1024 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
572 runs0 likes0 downloads0 reach0 impact
100 instances - 3 features - 2 classes - 0 missing values
1. Title: Dermatology Database 2. Source Information: (a) Original owners: -- 1. Nilsel Ilter, M.D., Ph.D., Gazi University, School of Medicine 06510 Ankara, Turkey Phone: +90 (312) 214 1080 -- 2. H.…
1756 runs0 likes0 downloads0 reach0 impact
366 instances - 35 features - 6 classes - 8 missing values
A simple database containing 17 Boolean-valued attributes describing animals. The "type" attribute appears to be the class attribute. Notes: * I find it unusual that there are 2 instances of "frog"…
191 runs0 likes0 downloads0 reach0 impact
101 instances - 17 features - 7 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
766 runs0 likes0 downloads0 reach0 impact
55 instances - 3 features - 2 classes - 0 missing values
1. Title: Echocardiogram Data 2. Source Information: -- Donor: Steven Salzberg (salzberg@cs.jhu.edu) -- Collector: -- Dr. Evlin Kinney -- The Reed Institute -- P.O. Box 402603 -- Maimi, FL 33140-0603…
0 runs0 likes0 downloads0 reach0 impact
132 instances - 8 features - 4 classes - 103 missing values
Context So it's Halloween again dear Kagglers! And what better way of celebrating than with some NLP! The dataset brings you the reviews of popular Halloween costumes sold on amazon as of November…
0 runs0 likes0 downloads0 reach0 impact
7814 instances - 5 features - classes - 16 missing values
Context This dataset is a small snap ( sample) out of ocean-depth entries in the original dataset, which keeps increasing day by day. The purpose of this dataset is to allow fellow Scientists/…
0 runs0 likes0 downloads0 reach0 impact
97606 instances - 5 features - 0 classes - 0 missing values
Context Data is collected daily from Our World in Data GitHub repository for covid-19, merged and uploaded. Content The data contains the following information: Country- this is the country for which…
0 runs0 likes0 downloads0 reach0 impact
97606 instances - 5 features - classes - 0 missing values
Context While brainstorming ideas for a statistics project for a course last semester, the idea of utilizing data about microbreweries came up. Unfortunately after some exploration and thought, we…
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 6 features - classes - 8 missing values
Context This is a continually updated dataset of professional fighters making fight predictions. Content The data is gathered mostly from James Lynch's YouTube channel, where fighters are asked to…
0 runs0 likes0 downloads0 reach0 impact
3401 instances - 6 features - classes - 0 missing values
What is World of Wacraft? According to Wikipedia: World of Warcraft (WoW) is a massively multiplayer online role-playing game (MMORPG) released in 2004 by Blizzard Entertainment. It is the fourth…
0 runs0 likes0 downloads0 reach0 impact
478 instances - 6 features - classes - 1 missing values
Do two sentences come from the same article? We randomly sampled sentences from across Wikipedia. Some sentences came from the same articles, others do not. Sentences from the Same Article These two…
0 runs0 likes0 downloads0 reach0 impact
129156 instances - 3 features - classes - 0 missing values
Dataset Information This dataset is from a 2014 survey that measures attitudes towards mental health and frequency of mental health disorders in the tech workplace. You are also encouraged to analyze…
0 runs0 likes0 downloads0 reach0 impact
1259 instances - 27 features - classes - 1892 missing values
About our Dataset The journey of the collection of this Covid-19 India dataset begin with a competition where we have to do sentiment analysis of tweets. The data was collected from…
0 runs0 likes0 downloads0 reach0 impact
648958 instances - 4 features - classes - 10980 missing values
Context Medium is one of the most famous tools for spreading knowledge about almost any field. It is widely used to published articles on ML, AI, and data science. This dataset is the collection of…
0 runs0 likes0 downloads0 reach0 impact
337 instances - 5 features - classes - 0 missing values
Context This dataset includes data from a random sample of 20,000 digital and 20,000 film-screen mammograms received by women age 60-89 years within the Breast Cancer Surveillance Consortium (BCSC)…
0 runs0 likes0 downloads0 reach0 impact
39998 instances - 12 features - classes - 0 missing values
Source Acknowledgements: Data was collected from MPII Human Pose and transformed into a '.csv' file. Source License: Copyright (c) 2015, Max Planck Institute for Informatics All rights reserved.…
0 runs0 likes0 downloads0 reach0 impact
4028 instances - 7 features - classes - 4028 missing values
Context The most common website that provided computer hardware components we chose the Newegg website it has hardware systems, Buy PC Parts, Laptops, Electronics More. Now Shipping to Saudi Arabia!…
0 runs0 likes0 downloads0 reach0 impact
2705 instances - 6 features - classes - 1565 missing values
A copy of PLK_Mini dataset from Meta album Set0
0 runs0 likes0 downloads0 reach0 impact
3440 instances - 3 features - 86 classes - 3440 missing values
Data from https://doi.org/10.5281/zenodo.269636
0 runs0 likes0 downloads0 reach0 impact
4758 instances - 39 features - classes - 0 missing values
analcatdata_happiness-pmlb
31 runs0 likes0 downloads0 reach0 impact
60 instances - 4 features - 3 classes - 0 missing values
No data.
291 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 7 classes - 0 missing values
No data.
309 runs0 likes0 downloads0 reach0 impact
1000000 instances - 35 features - 6 classes - 0 missing values
## **Meta-Album DIBaS Dataset (Extended)** The Digital Images of Bacteria Species dataset (DIBaS) (https://github.com/gallardorafael/DIBaS-Dataset) is a dataset of 33 bacterial species with around 20…
0 runs0 likes0 downloads0 reach1 impact
4060 instances - 3 features - 33 classes - 4060 missing values
test
0 runs0 likes0 downloads0 reach0 impact
3 instances - 2 features - classes - 0 missing values
Upstream data from the twin Archimedes screw hydro-electric generator on the river Thames at Caversham weir, Reading, UK.
0 runs0 likes0 downloads0 reach0 impact
235 instances - 2 features - 0 classes - 0 missing values
Context This dataset was created by our in house Web Scraping and Data Mining teams at PromptCloud and DataStock. You can download the full dataset here. This sample contains 30K records. Content…
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 16 features - classes - 49458 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
2 runs0 likes0 downloads0 reach0 impact
108 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
450 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
17x17x2x2 tables of counts in GLIM-ready format used for the analyses in Biblarz, Timothy J., and Adrian E. Raftery. 1993. "The Effects of Family Disruption on Social Mobility." American Sociological…
6 runs0 likes0 downloads0 reach0 impact
1156 instances - 6 features - 0 classes - 0 missing values
Data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984), and reanalyzed by Raftery and Hout (1985, 1993). ###…
16174 runs0 likes0 downloads0 reach0 impact
500 instances - 6 features - 2 classes - 32 missing values
Datasets for `Pattern Recognition and Neural Networks' by B.D. Ripley ===================================================================== Cambridge University Press (1996) ISBN 0-521-46086-7 The…
41 runs0 likes0 downloads0 reach0 impact
27 instances - 3 features - 4 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
366 instances - 5 features - classes - 2 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7. The background to the datasets is described in section 1.4; this file…
1105 runs0 likes0 downloads0 reach0 impact
250 instances - 3 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
100 instances - 10 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
100 instances - 3 features - 0 classes - 0 missing values
Human Development Index [DATA] United Nations Development Program compiled an Index of Human Development. Column 1: Country(character) 2: Index 3: GNP GNP PER CAPITA RANK RANK - RANK HDI 1987 GNP RANK…
2 runs0 likes0 downloads0 reach0 impact
130 instances - 2 features - 0 classes - 0 missing values
The data consist of 2001 observations taken from a balloon about 30 kilometres above the surface of the earth. In the section of the flight shown here the balloon increases in height. As radiation…
0 runs0 likes0 downloads0 reach0 impact
2001 instances - 2 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
649 instances - 3 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Electicity usage is being treated as the…
4 runs0 likes0 downloads0 reach0 impact
55 instances - 3 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 0 classes - 9 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
2 runs0 likes0 downloads0 reach0 impact
264 instances - 3 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach0 impact
50 instances - 3 features - classes - 0 missing values
Data originating from the book "Analyzing Categorical Data" by Jeffrey S. Simonoff.
1087 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 2 classes - 0 missing values
Grass Grubs and Damage Ranking Data source: R. J. Townsend AgResearch, Lincoln, New Zealand Grass grubs are one of the major insect pests of pasture in Canterbury and can cause severe pasture damage…
988 runs0 likes0 downloads0 reach0 impact
155 instances - 9 features - 4 classes - 0 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
599 runs0 likes0 downloads0 reach0 impact
1024 instances - 3 features - 4 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach0 impact
185 instances - 2 features - classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
123 runs0 likes0 downloads0 reach0 impact
46 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1088 runs0 likes0 downloads0 reach0 impact
132 instances - 4 features - 2 classes - 0 missing values
An artificial data set where instances belongs to several clusters with a banana shape. There are two attributes At1 and At2 corresponding to the x and y axis, respectively. The class label (-1 and 1)…
163 runs0 likes0 downloads0 reach0 impact
5300 instances - 3 features - 2 classes - 0 missing values
* Dataset Title: Wall-Following Robot Navigation Data Data Set (version with 2 Attributes) * Abstract: The data were collected as the SCITOS G5 robot navigates through the room following the wall in a…
109 runs0 likes0 downloads0 reach0 impact
5456 instances - 3 features - 4 classes - 0 missing values
ARFF version of UCI dataset 'flags'. Creators: Collected primarily from the "Collins Gem Guide to Flags": Collins Publishers (1986). Donor: Richard S. Forsyth. Date 5/15/1990 This data file contains…
108 runs0 likes0 downloads0 reach0 impact
194 instances - 29 features - 8 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
119 runs0 likes0 downloads0 reach0 impact
39 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1164 runs0 likes0 downloads0 reach0 impact
222 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1038 runs0 likes0 downloads0 reach0 impact
147 instances - 7 features - 2 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
2 runs0 likes0 downloads0 reach0 impact
60 instances - 17 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
771 runs0 likes0 downloads0 reach0 impact
468 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1043 runs0 likes0 downloads0 reach0 impact
125 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
111 runs0 likes0 downloads0 reach0 impact
52 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
903 runs0 likes0 downloads0 reach0 impact
468 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
767 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
119 runs0 likes0 downloads0 reach0 impact
50 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
762 runs0 likes0 downloads0 reach0 impact
88 instances - 3 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
731 runs0 likes0 downloads0 reach0 impact
151 instances - 7 features - 3 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
116640 instances - 10 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning…
10 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 0 classes - 2 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
140 runs0 likes0 downloads0 reach0 impact
194 instances - 29 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
842 runs0 likes0 downloads0 reach0 impact
155 instances - 9 features - 2 classes - 0 missing values
Abstract: The data set is composed of 60 chorales (5665 events) by J.S. Bach (1675-1750). Each event of each chorale is labelled using 1 among 101 chord labels and described through 14 features.…
31 runs0 likes0 downloads0 reach0 impact
5665 instances - 17 features - 102 classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach0 impact
50789 instances - 20 features - 3 classes - 154107 missing values
The dataset freMTPL2sev contains claim amounts for 26,639 motor third-part liability policies.
0 runs0 likes0 downloads0 reach0 impact
26639 instances - 2 features - classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Historical Rainfall data of Bangladesh
0 runs0 likes0 downloads0 reach0 impact
16755 instances - 4 features - 0 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
__Major changes w.r.t. version 2: ignored variable 3 in this upload as this seems to be ea perfect predictor.__ Tamilnadu Electricity Board Hourly Readings dataset. Real-time readings were collected…
0 runs0 likes2 downloads2 reach19 impact
45781 instances - 4 features - 20 classes - 0 missing values
This file holds global land temperatures by country
0 runs0 likes0 downloads0 reach0 impact
577462 instances - 4 features - classes - 64563 missing values
holds information on average temperature per country
0 runs0 likes0 downloads0 reach0 impact
577462 instances - 4 features - classes - 64563 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Context We publish the data to clarify the real evolution of forest area in post-communist Romania. The data is from the National Statistics Institute of Romania, so these are the official reported…
0 runs0 likes0 downloads0 reach0 impact
8111 instances - 4 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Subsampling of the dataset okcupid-stem (42734) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 20 features - 3 classes - 5980 missing values
Subsampling of the dataset okcupid-stem (42734) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 20 features - 3 classes - 6117 missing values
Subsampling of the dataset okcupid-stem (42734) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 20 features - 3 classes - 6105 missing values