OpenML
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103071, and it has 150 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
150 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101252, and it has 33 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
33 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17076, and it has 51 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
51 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10244, and it has 396 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
396 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 260, and it has 18 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
18 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10373, and it has 53 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
53 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10143, and it has 62 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
62 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103910, and it has 14 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
14 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12345, and it has 42 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
42 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 246, and it has 1135 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
1135 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 168, and it has 412 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
412 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101552, and it has 73 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
73 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12673, and it has 183 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
183 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12666, and it has 2243 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
2243 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 247, and it has 1171 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
1171 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11691, and it has 715 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
715 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 14037, and it has 4378 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
4378 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30017, and it has 1058 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1058 instances - 1026 features - 0 classes - 0 missing values
No data.
60 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10190, and it has 66 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
66 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19905, and it has 3048 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
3048 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10880, and it has 955 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
955 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10167, and it has 89 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
89 instances - 1026 features - 0 classes - 0 missing values
No data.
37 runs0 likes0 downloads0 reach0 impact
1000000 instances - 70 features - 24 classes - 0 missing values
No data.
33 runs0 likes0 downloads0 reach0 impact
1000000 instances - 70 features - 24 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 33 features - 0 classes - 0 missing values
No data.
313 runs0 likes0 downloads0 reach0 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
47 runs0 likes0 downloads0 reach0 impact
1000000 instances - 45 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
78732 instances - 11 features - 0 classes - 0 missing values
No data.
288 runs0 likes0 downloads0 reach0 impact
1000000 instances - 15 features - 9 classes - 0 missing values
Data on predicting clicks on ads in a search engine.
0 runs0 likes0 downloads0 reach0 impact
1496391 instances - 10 features - 2 classes - 0 missing values
Even smaller sample of version 1
0 runs0 likes0 downloads0 reach0 impact
149639 instances - 12 features - 2 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
313 runs0 likes0 downloads0 reach0 impact
399482 instances - 12 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
531441 instances - 12 features - 0 classes - 0 missing values
No data.
51 runs0 likes0 downloads0 reach0 impact
1000000 instances - 15 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 0 classes - 0 missing values
No data.
311 runs0 likes0 downloads0 reach0 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
307 runs0 likes0 downloads0 reach0 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
306 runs0 likes0 downloads0 reach0 impact
1000000 instances - 4 features - 2 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 0.1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
63421 runs0 likes0 downloads0 reach0 impact
39948 instances - 10 features - 2 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 3 classes - 10000 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
17496 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
59049 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
116640 instances - 10 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 26 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
177147 instances - 11 features - 0 classes - 0 missing values
No data.
90 runs0 likes0 downloads0 reach0 impact
663552 instances - 13 features - 2 classes - 0 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs0 likes0 downloads0 reach0 impact
798964 instances - 10 features - 3 classes - 399482 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
144 instances - 77 features - 0 classes - 0 missing values
eating
9413 runs0 likes0 downloads0 reach0 impact
945 instances - 6374 features - 7 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 70 features - 24 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
30 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
31 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
31 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
29 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
30 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
34 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
* Dataset Title: MicroMass - Mixed (mixed spectra version) * Abstract: A dataset to explore machine learning approaches for the identification of microorganisms from mass-spectrometry data. * Source:…
64 runs0 likes0 downloads0 reach0 impact
360 instances - 1301 features - 10 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
159 runs0 likes0 downloads0 reach0 impact
200 instances - 14 features - 5 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
170 runs0 likes0 downloads0 reach0 impact
123 instances - 13 features - 5 classes - 0 missing values
* Title: seismic-bumps Data Set * Abstract: The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a…
152 runs0 likes0 downloads0 reach0 impact
210 instances - 8 features - 3 classes - 0 missing values
Tattile Via Gaetano Donizetti, 1-3-5,25030 Mairano (Brescia), Italy. ### Dataset Description Semeion Handwritten Digit Data Set, where 1593 handwritten digits from around 80 persons were scanned and…
33402 runs0 likes0 downloads0 reach0 impact
1593 instances - 257 features - 10 classes - 0 missing values
* Title: Skin Segmentation Data Set * Abstract: The Skin Segmentation dataset is constructed over B, G, R color space. Skin and Nonskin dataset is generated using skin textures from face images of…
15 runs0 likes0 downloads0 reach0 impact
245057 instances - 4 features - 2 classes - 0 missing values
The data were collected as the SCITOS G5 robot navigates through the room following the wall in a clockwise direction, for 4 rounds, using 24 ultrasound sensors arranged circularly around its 'waist'.…
25664 runs0 likes0 downloads0 reach0 impact
5456 instances - 25 features - 4 classes - 0 missing values
* Title: South Africa Heart Disease Dataset * Description A retrospective sample of males in a heart-disease high-risk region of the Western Cape, South Africa. There are roughly two controls per case…
155 runs0 likes0 downloads0 reach0 impact
462 instances - 10 features - 2 classes - 0 missing values
* Title: seeds Data Set * Abstract: Measurements of geometrical properties of kernels belonging to three different varieties of wheat. A soft X-ray technique and GRAINS package were used to construct…
190 runs0 likes0 downloads0 reach0 impact
210 instances - 8 features - 3 classes - 0 missing values
* Twonorm dataset This is an implementation of Leo Breiman's twonorm example[1]. It is a 20 dimensional, 2 class classification example. Each class is drawn from a multivariate normal distribution…
118 runs0 likes0 downloads0 reach0 impact
7400 instances - 21 features - 2 classes - 0 missing values
* Title: User Knowledge Modeling Data Set * Abstract: It is the real dataset about the students' knowledge status about the subject of Electrical DC Machines. The dataset had been obtained from Ph.D.…
153 runs0 likes0 downloads0 reach0 impact
403 instances - 6 features - 5 classes - 0 missing values
The dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in the wild over a predefined path. The dataset is…
80 runs0 likes0 downloads0 reach0 impact
149332 instances - 5 features - 22 classes - 0 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Original) Data Set with a different set of…
226893 runs0 likes0 downloads0 reach0 impact
569 instances - 31 features - 2 classes - 0 missing values
* Title: Wholesale customers Data Set * Abstract: The data set refers to clients of a wholesale distributor. It includes the annual spending in monetary units (m.u.) on diverse product categories *…
161 runs0 likes0 downloads0 reach0 impact
440 instances - 9 features - 2 classes - 0 missing values
* Title: Thoracic Surgery Data Data Set * Abstract: The data is dedicated to classification problem related to the post-operative life expectancy in the lung cancer patients: class 1 - death within…
145 runs0 likes0 downloads0 reach0 impact
470 instances - 17 features - 2 classes - 0 missing values
* Title of Database: Spoken Arabic Digit * Abstract: This dataset contains time series of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 males…
1 runs0 likes0 downloads0 reach0 impact
263256 instances - 15 features - 10 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
697641 instances - 47237 features - 0 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: Vikas Sindhwani for the SVMlin project.
0 runs0 likes0 downloads0 reach0 impact
72309 instances - 20959 features - 0 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-bodies * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set Information:…
3 runs0 likes0 downloads0 reach0 impact
64 instances - 4703 features - 2 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects-stemmed * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
71 runs0 likes0 downloads0 reach0 impact
64 instances - 230 features - 2 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-subjects * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
40 runs0 likes0 downloads0 reach0 impact
64 instances - 243 features - 2 classes - 0 missing values
A 3-class version of Cardiotocography dataset.
134 runs0 likes0 downloads0 reach0 impact
2126 instances - 36 features - 3 classes - 0 missing values
* Dataset: DBworld e-mails data set Task: dbworld-bodies-stemmed * Source: Michele Filannino, PhD University of Manchester Centre for Doctoral Training Email: filannim_AT_cs.man.ac.uk * Data Set…
0 runs0 likes0 downloads0 reach0 impact
64 instances - 3722 features - 2 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-700 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity of…
4537 runs0 likes0 downloads0 reach0 impact
700 instances - 13 features - 3 classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-cpd1-500 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and heterogeneity…
7145 runs0 likes0 downloads0 reach0 impact
500 instances - 13 features - 5 classes - 0 missing values
* Abstract: The data was created by a medical expert as a data set to test the expert system, which will perform the presumptive diagnosis of two diseases of the urinary system. This is a…
423 runs0 likes0 downloads0 reach0 impact
120 instances - 7 features - 2 classes - 0 missing values
* Abstract: Purpose is to predict poker hands * Source - Creators: Robert Cattral (cattral '@' gmail.com) Franz Oppacher (oppacher '@' scs.carleton.ca) Carleton University, Department of Computer…
1 runs0 likes0 downloads0 reach0 impact
1025009 instances - 11 features - 10 classes - 0 missing values
* Title: Nursery Database * Abstract: 4-class version of the original Nursery dataset
121 runs0 likes0 downloads0 reach0 impact
12958 instances - 9 features - 4 classes - 0 missing values
* Abstract: 9-class version of poker-hand dataset, it was removed the minority class.
1 runs0 likes0 downloads0 reach0 impact
1025000 instances - 11 features - 9 classes - 0 missing values
* Dataset: This is a reprocessed version of heart-h (hungarian), the heart disease reprocessed hungarian dataset from UCI.
138 runs0 likes0 downloads0 reach0 impact
294 instances - 14 features - 5 classes - 0 missing values
* Dataset: Hill valley dataset. A noiseless version of the data set.
117 runs0 likes0 downloads0 reach0 impact
1212 instances - 101 features - 2 classes - 0 missing values
* Abstract: A 3-class version of abalone dataset. * Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and…
176 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 3 classes - 0 missing values