OpenML
Filter results by:
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10908, and it has 13 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
13 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 214, and it has 770 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
770 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11635, and it has 1003 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1003 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12861, and it has 235 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
235 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12635, and it has 85 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
85 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19624, and it has 52 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
52 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19907, and it has 13 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
13 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30042, and it has 81 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
81 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101404, and it has 121 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
121 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10651, and it has 235 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
235 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12214, and it has 330 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
330 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12469, and it has 102 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
102 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101317, and it has 99 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
99 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 19607, and it has 143 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
143 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 117, and it has 818 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
818 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11536, and it has 1306 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1306 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10889, and it has 14 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
14 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10677, and it has 59 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
59 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11639, and it has 201 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
201 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20128, and it has 100 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
100 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 69, and it has 569 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
569 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100098, and it has 472 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
472 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12825, and it has 847 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
847 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10613, and it has 247 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
247 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11626, and it has 507 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
507 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11843, and it has 84 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
84 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10323, and it has 97 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
97 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11947, and it has 17 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
17 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 88, and it has 895 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
895 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10131, and it has 1979 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1979 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10542, and it has 351 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
351 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 102399, and it has 82 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
82 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10235, and it has 93 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
93 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11336, and it has 1199 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1199 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10579, and it has 388 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
388 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103088, and it has 13 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
13 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30026, and it has 553 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
553 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30013, and it has 244 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
244 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10926, and it has 62 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
62 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 215, and it has 726 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
726 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12268, and it has 1352 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1352 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 106, and it has 1236 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
1236 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100042, and it has 14 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
14 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 102718, and it has 31 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
31 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 51, and it has 3356 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
3356 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10230, and it has 19 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
19 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11430, and it has 614 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
614 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101058, and it has 594 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
594 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 163, and it has 2570 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
2570 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10426, and it has 27 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
27 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 48, and it has 551 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
551 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100995, and it has 785 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
785 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17147, and it has 18 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
18 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 102711, and it has 74 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
74 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100075, and it has 145 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
145 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 103453, and it has 73 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
73 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11575, and it has 1490 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1490 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101073, and it has 73 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
73 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10207, and it has 89 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
89 instances - 1026 features - 0 classes - 0 missing values
Source: 1. Olcay KURSUN, PhD., Istanbul University, Department of Computer Engineering, 34320, Istanbul, Turkey Phone: +90 (212) 473 7070 - 17827 Email: okursun '@' istanbul.edu.tr 2. Betul ERDOGDU…
0 runs0 likes0 downloads0 reach0 impact
1039 instances - 29 features - classes - 0 missing values
### Description Cylinder bands UCI dataset - Process delays known as cylinder banding in rotogravure printing were substantially mitigated using control rules discovered by decision tree induction.…
21778 runs0 likes0 downloads0 reach0 impact
540 instances - 40 features - 2 classes - 999 missing values
Data on tree growth used in the Case Study published in the September, 1995 issue of the Canadian Journal of Statistics. This data set was been provided by Dr. Fernando Camacho, Ontario Hydro…
18757 runs0 likes0 downloads0 reach0 impact
2796 instances - 35 features - 6 classes - 68100 missing values
Source: Creators : François Kawala (1,2) Ahlame Douzal (1) Eric Gaussier (1) Eustache Diemert (2) Institutions : (1) Université Joseph Fourier (Grenoble I) Laboratoire d'informatique de…
0 runs0 likes0 downloads0 reach0 impact
28179 instances - 97 features - classes - 0 missing values
Abstract: This data-set contains examples of buzz events from two different social networks: Twitter, and Tom's Hardware, a forum network focusing on new technology with more conservative dynamics.…
2 runs0 likes0 downloads0 reach0 impact
583250 instances - 78 features - 0 classes - 0 missing values
Abstract: CART book's waveform domains Source: Original Owners: Breiman,L., Friedman,J.H., Olshen,R.A., & Stone,C.J. (1984). Classification and Regression Trees. Wadsworth International Group:…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 22 features - classes - 0 missing values
Abstract: The data set is composed of 60 chorales (5665 events) by J.S. Bach (1675-1750). Each event of each chorale is labelled using 1 among 101 chord labels and described through 14 features.…
31 runs0 likes0 downloads0 reach0 impact
5665 instances - 17 features - 102 classes - 0 missing values
Source: Original Owner: U.S. Census Bureau http://www.census.gov/ United States Department of Commerce Donor: Terran Lane and Ronny Kohavi Data Mining and Visualization Silicon Graphics. terran '@'…
0 runs0 likes0 downloads0 reach0 impact
299285 instances - 42 features - classes - 0 missing values
Gestures from Rest Positions. In: Symposium on Applied Computing (SAC), 2013, Coimbra. Proceedings of the 28th Annual ACM Symposium on Applied Computing (SAC), 2013. p. 46-52. Data Set Information:…
74 runs0 likes0 downloads0 reach0 impact
Predicting the Geographical Origin of Music, ICDM, 2014 Abstract: Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the…
0 runs0 likes0 downloads0 reach0 impact
1059 instances - 118 features - 0 classes - 0 missing values
This dataset summarizes a heterogeneous set of features about articles published by Mashable in a period of two years. The goal is to predict the number of shares in social networks (popularity). *…
0 runs0 likes0 downloads0 reach0 impact
39644 instances - 61 features - 0 classes - 0 missing values
USDA, NRCS. 2008. The PLANTS Database ([Web Link], 31 December 2008). National Plant Data Center, Baton Rouge, LA 70874-4490 USA. Abstract: Data has been extracted from the USDA plants database. It…
0 runs0 likes0 downloads0 reach0 impact
Abstract: This data set contains a total 5820 evaluation scores provided by students from Gazi University in Ankara (Turkey). There is a total of 28 course specific questions and additional 5…
0 runs0 likes0 downloads0 reach0 impact
5820 instances - 33 features - classes - 0 missing values
Abstract: This data contains general demographic information on internet users in 1997. Source: Original Owner: Graphics, Visualization, & Usability Center College of Computing Geogia Institute of…
0 runs0 likes0 downloads0 reach0 impact
Abstract: This dataset contains timeseries of mel-frequency cepstrum coefficients (MFCCs) corresponding to spoken Arabic digits. Includes data from 44 male and 44 female native Arabic speakers.…
0 runs0 likes0 downloads0 reach0 impact
178526 instances - 13 features - classes - 57200 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes0 downloads0 reach0 impact
7619400 instances - 6 features - 0 classes - 0 missing values
And another sample. (v. 2 without OpenML metainfo)
0 runs0 likes0 downloads0 reach0 impact
89640 instances - 6 features - classes - 0 missing values
Sample with OpenML metadata.
0 runs0 likes0 downloads0 reach0 impact
761940 instances - 6 features - 0 classes - 0 missing values
YAGO Schema.
0 runs0 likes0 downloads0 reach0 impact
181 instances - 4 features - classes - 0 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes0 downloads0 reach0 impact
7619400 instances - 6 features - 0 classes - 0 missing values
## Guess which points belong to signal track [COMET](http://comet.kek.jp/Introduction.html) is an experiment being constructed at the J-PARC proton beam laboratory in Japan. It will search for…
0 runs0 likes0 downloads0 reach0 impact
7619400 instances - 6 features - 0 classes - 0 missing values
This is a sesnor data for test it is not complete.
0 runs0 likes0 downloads0 reach0 impact
127591 instances - 27 features - classes - 0 missing values
Sampled http://www.openml.org/d/5889
0 runs0 likes0 downloads0 reach0 impact
761940 instances - 6 features - classes - 0 missing values
Another sample of COMET_MC
0 runs0 likes0 downloads0 reach0 impact
89640 instances - 6 features - 0 classes - 0 missing values
Multi-label dataset. The genbase dataset contains protein sequences that can be assigned to several classes of protein families.
0 runs0 likes0 downloads0 reach0 impact
662 instances - 1212 features - classes - 0 missing values
Multi-label dataset. The image benchmark dataset consists of 2000 natural scene images. Zhou and Zhang (2007) extracted 135 features for each image and made it publicly available as processed image…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 140 features - classes - 0 missing values
Multi-label dataset. Audio dataset (emotions) consists of 593 musical files with 6 clustered emotional labels and 72 predictors. Each song can be labeled with one or more of the labels…
0 runs0 likes0 downloads0 reach0 impact
593 instances - 78 features - classes - 0 missing values
Multi-label dataset. The UC Berkeley enron4 dataset represents a subset of the original enron5 dataset and consists of 1684 cases of emails with 21 labels and 1001 predictor variables.
0 runs0 likes0 downloads0 reach0 impact
1702 instances - 1054 features - classes - 0 missing values
The langLog dataset includes 1004 textual predictors and was originally compiled in the doctorial thesis of Read (2010). It consists of 956 text samples that can be assigned to one or more topics such…
0 runs0 likes0 downloads0 reach0 impact
1460 instances - 1079 features - classes - 0 missing values
Multi-label dataset. A subset of the reuters dataset includes 2000 observations for text classification.
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 250 features - classes - 0 missing values
Multi-label dataset. The scene dataset is an image classification task where labels like Beach, Mountain, Field, Urban are assigned to each image.
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 300 features - classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach0 impact
50789 instances - 20 features - 3 classes - 154107 missing values
It has 3 attributes (ID, tweet, label ) 91299 tweets with non-sarcastic 39998 tweets and 51300 sarcastic tweets.
0 runs0 likes0 downloads0 reach0 impact
91298 instances - 2 features - 0 classes - 0 missing values
Multi-label dataset. The birds dataset consists of 327 audio recordings of 12 different vocalizing bird species. Each sound can be assigned to various bird species.
0 runs0 likes0 downloads0 reach0 impact
645 instances - 279 features - classes - 0 missing values
This data was collected from combine primary and secondary sources, through questionnaire, verbal interview and some part of the hospital’s record department’s data, from the selected…
0 runs0 likes0 downloads0 reach0 impact
281 instances - 98 features - 2 classes - 2 missing values
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.
0 runs0 likes0 downloads0 reach0 impact
39948 instances - 12 features - 2 classes - 0 missing values
Sensor data measurements of one Boiler, containing WaterInput/SteamOutput (flow, temperature, pressure) for one month, which is measured every minute.
0 runs0 likes0 downloads0 reach0 impact
44643 instances - 8 features - classes - 44643 missing values
These weekly averages are ultimately based on measurements of 4 air samples per hour taken atop intake lines on several towers during steady periods of CO2 concentration of not less than 6 hours per…
0 runs0 likes0 downloads0 reach0 impact
2225 instances - 7 features - 0 classes - 0 missing values
Of all the universities in the world, which are the best? Ranking universities is a difficult, political, and controversial practice. There are hundreds of different national and international…
0 runs0 likes0 downloads0 reach0 impact
1029 instances - 14 features - classes - 200 missing values
Los Angeles ozone pollution data, 1976
0 runs0 likes0 downloads0 reach0 impact
Klaverjas is an example of the Jack-Nine card games, which are characterized as trick-taking games where the the Jack and nine of the trump suit are the highest-ranking trumps, and the tens and aces…
0 runs0 likes0 downloads0 reach0 impact
981541 instances - 33 features - 2 classes - 0 missing values