Data
Filter results by:
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
63421 runs0 likes0 downloads0 reach0 impact
39948 instances - 10 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
17496 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
59049 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
116640 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 26 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
177147 instances - 11 features - 0 classes - 0 missing values
No data.
90 runs0 likes0 downloads0 reach0 impact
663552 instances - 13 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
144 instances - 77 features - 0 classes - 0 missing values
eating
9413 runs0 likes0 downloads0 reach0 impact
945 instances - 6374 features - 7 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 70 features - 24 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
30 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
31 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
31 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
30 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
34 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
* Dataset Title: MicroMass - Mixed (mixed spectra version) * Abstract: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. * Source:…
64 runs0 likes0 downloads0 reach0 impact
360 instances - 1301 features - 10 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
159 runs0 likes0 downloads0 reach0 impact
200 instances - 14 features - 5 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
170 runs0 likes0 downloads0 reach0 impact
123 instances - 13 features - 5 classes - 0 missing values
* Title: seismic-bumps Data Set * Abstract: The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a…
152 runs0 likes0 downloads0 reach0 impact
210 instances - 8 features - 3 classes - 0 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. ### Dataset Description Semeion Handwritten Digit Data Set, where 1593 handwritten digits from around 80 persons were scanned and…
33402 runs0 likes0 downloads0 reach0 impact
1593 instances - 257 features - 10 classes - 0 missing values
* Title: Skin Segmentation Data Set * Abstract: The Skin Segmentation dataset is constructed over B, G, R color space. Skin and Nonskin dataset is generated using skin textures from face images of…
15 runs0 likes0 downloads0 reach0 impact
245057 instances - 4 features - 2 classes - 0 missing values
The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4 rounds, using 24 ultrasound sensors arranged circularly around its 'waist'.…
25664 runs0 likes0 downloads0 reach0 impact
5456 instances - 25 features - 4 classes - 0 missing values
* Title: South Africa Heart Disease Dataset * Description A retrospective sample of males in a heart-disease high-risk region of the Western Cape, South Africa. There are roughly two controls per case…
155 runs0 likes0 downloads0 reach0 impact
462 instances - 10 features - 2 classes - 0 missing values
* Title: seeds Data Set * Abstract: Measurements of geometrical properties of kernels belonging to three different varieties of wheat. A soft X-ray technique and GRAINS package were used to construct…
190 runs0 likes0 downloads0 reach0 impact
210 instances - 8 features - 3 classes - 0 missing values
* Twonorm dataset This is an implementation of Leo Breiman's twonorm example[1]. It is a 20 dimensional, 2 class classification example. Each class is drawn from a multivariate normal distribution…
118 runs0 likes0 downloads0 reach0 impact
7400 instances - 21 features - 2 classes - 0 missing values
* Title: User Knowledge Modeling Data Set * Abstract: It is the real dataset about the students' knowledge status about the subject of Electrical DC Machines. The dataset had been obtained from Ph.D.…
153 runs0 likes0 downloads0 reach0 impact
403 instances - 6 features - 5 classes - 0 missing values
The dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in the wild over a predefined path. The dataset is…
80 runs0 likes0 downloads0 reach0 impact
149332 instances - 5 features - 22 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of…
226893 runs0 likes0 downloads0 reach0 impact
569 instances - 31 features - 2 classes - 0 missing values
* Title: Wholesale customers Data Set * Abstract: The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories *…
161 runs0 likes0 downloads0 reach0 impact
440 instances - 9 features - 2 classes - 0 missing values
A dataset of steel plates' faults, classified into 7 different types. The goal was to train machine learning for automatic pattern recognition. The dataset consists of 27 features describing each…
277767 runs2 likes52 downloads54 reach26 impact
1941 instances - 34 features - 2 classes - 0 missing values
* Title: Thoracic Surgery Data Data Set * Abstract: The data is dedicated to classification problem related to the post-operative life expectancy in the lung cancer patients: class 1 - death within…
145 runs0 likes0 downloads0 reach0 impact
470 instances - 17 features - 2 classes - 0 missing values
* Title of Database: Spoken Arabic Digit * Abstract: This dataset contains time series of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 males…
1 runs0 likes0 downloads0 reach0 impact
263256 instances - 15 features - 10 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
697641 instances - 47237 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: Vikas Sindhwani for the SVMlin project.
0 runs0 likes0 downloads0 reach0 impact
72309 instances - 20959 features - 0 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-bodies * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set Information:…
3 runs0 likes0 downloads0 reach0 impact
64 instances - 4703 features - 2 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects-stemmed * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
71 runs0 likes0 downloads0 reach0 impact
64 instances - 230 features - 2 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
40 runs0 likes0 downloads0 reach0 impact
64 instances - 243 features - 2 classes - 0 missing values
A 3-class version of Cardiotocography dataset.
134 runs0 likes0 downloads0 reach0 impact
2126 instances - 36 features - 3 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-bodies-stemmed * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
0 runs0 likes0 downloads0 reach0 impact
64 instances - 3722 features - 2 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-700 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
4537 runs0 likes0 downloads0 reach0 impact
700 instances - 13 features - 3 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-cpd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
7145 runs0 likes0 downloads0 reach0 impact
500 instances - 13 features - 5 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au6-1000 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
11010 runs0 likes16 downloads16 reach47 impact
1000 instances - 41 features - 8 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. This is a…
423 runs0 likes0 downloads0 reach0 impact
120 instances - 7 features - 2 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes0 downloads0 reach0 impact
1025009 instances - 11 features - 10 classes - 0 missing values
* Title: Nursery Database * Abstract: 4-class version of the original Nursery dataset
121 runs0 likes0 downloads0 reach0 impact
12958 instances - 9 features - 4 classes - 0 missing values
* Abstract: 9-class version of poker-hand dataset, it was removed the minority class.
1 runs0 likes0 downloads0 reach0 impact
1025000 instances - 11 features - 9 classes - 0 missing values
* Dataset: This is a reprocessed version of heart-h (hungarian), the heart disease reprocessed hungarian dataset from UCI.
138 runs0 likes0 downloads0 reach0 impact
294 instances - 14 features - 5 classes - 0 missing values
* Dataset: Hill valley dataset. A noiseless version of the data set.
117 runs0 likes0 downloads0 reach0 impact
1212 instances - 101 features - 2 classes - 0 missing values
* Abstract: A 3-class version of abalone dataset. * Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and…
176 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 3 classes - 0 missing values
* Dataset: Reduced version (10 % of the examples) of bank-marketing dataset.
1259 runs0 likes0 downloads0 reach0 impact
4521 instances - 17 features - 2 classes - 0 missing values
A 4-class version of breast-tissue dataset.
299 runs0 likes0 downloads0 reach0 impact
106 instances - 10 features - 4 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: transform to two-class
0 runs0 likes0 downloads0 reach0 impact
862 instances - 3 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach17 impact
1000 instances - 25 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: scaled to [-1,1]
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - 0 classes - 0 missing values
libSVM","AAD group IJCNN 2001 neural network competition. Slide presentation in IJCNN'01, Ford Research Laboratory, 2001. http://www.geocities.com/ijcnn/nnc_ijcnn01.pdf . #Dataset from the LIBSVM data…
0 runs0 likes0 downloads0 reach0 impact
191681 instances - 23 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12507, and it has 314 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
314 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11453, and it has 57 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
57 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100621, and it has 24 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
24 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 234, and it has 2145 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
2145 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11694, and it has 157 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
157 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103169, and it has 10 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
10 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100975, and it has 75 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
75 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100671, and it has 25 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
25 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 191, and it has 4442 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
4442 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 226, and it has 1431 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
1431 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11156, and it has 838 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
838 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 102421, and it has 74 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
74 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11024, and it has 2329 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
2329 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10475, and it has 1030 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1030 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 65, and it has 1515 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
1515 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12959, and it has 72 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
72 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101386, and it has 148 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
148 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100835, and it has 125 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
125 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10378, and it has 1330 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1330 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17081, and it has 440 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
440 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10781, and it has 2044 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
2044 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10849, and it has 1580 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1580 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101508, and it has 532 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
532 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100843, and it has 16 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
16 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11631, and it has 1255 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1255 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10280, and it has 3134 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
3134 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11574, and it has 230 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
230 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 14071, and it has 30 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
30 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12238, and it has 30 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
30 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100426, and it has 123 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
123 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 275, and it has 477 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
477 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20157, and it has 63 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
63 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12000, and it has 366 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
366 instances - 1026 features - 0 classes - 0 missing values