OpenML
Filter results by:
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
853 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
746 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
755 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
817 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
847 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes0 downloads0 reach0 impact
250 instances - 10936 features - 2 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
854 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
782 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
806 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values
* Abstract: Predict the Bankruptcy from Qualitative parameters from experts. * Source: Source Information -- Creator : Mr.A.Martin(jayamartin '@' yahoo.com) Mr.J.Uthayakumar (uthayakumar17691 '@'…
147 runs0 likes0 downloads0 reach0 impact
250 instances - 7 features - 2 classes - 0 missing values
Context This Data set Contains the values of Stock of Amazon which dates between 15 July ,2019 to 15 July,2020 Content The Data set contains 7 different columns that includes Date, The opening value…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 8 features - classes - 23 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 101 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
822 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
748 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
764 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
786 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
775 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values
The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
791 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
812 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 175, and it has 251 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
251 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
769 runs0 likes0 downloads0 reach0 impact
252 instances - 15 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
252 instances - 14 features - classes - 0 missing values
Short Summary: Lists estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for 252 men. Classroom use of this data set: This data set…
54 runs0 likes0 downloads0 reach0 impact
252 instances - 15 features - 0 classes - 0 missing values
Mean While 1
0 runs0 likes0 downloads0 reach0 impact
253 instances - 38 features - 2 classes - 0 missing values
Mega watt
183 runs0 likes0 downloads0 reach0 impact
253 instances - 38 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
253 instances - 15155 features - 2 classes - 0 missing values
The coronavirus pandemic has affected the entire world and many families have been destroyed. The stock exchange was also affected, but vaccine companies took advantage of this moment and leveraged…
0 runs0 likes0 downloads0 reach0 impact
255 instances - 36 features - classes - 35 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20158, and it has 257 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
257 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
66 runs0 likes0 downloads0 reach0 impact
259 instances - 10936 features - 2 classes - 0 missing values
Gross Domestic Product (GDP) is the monetary value of all finished goods and services made within a country during a specific period. GDP provides an economic snapshot of a country, used to estimate…
0 runs0 likes0 downloads0 reach0 impact
260 instances - 32 features - classes - 1074 missing values
This dataset includes demographic data for users who have rated the top 15 most-rated movies, ranked based on a star rating system.
0 runs0 likes0 downloads0 reach0 impact
260 instances - 79 features - classes - 0 missing values
This dataset includes demographic data for users who have rated the top 15 most-rated movies, ranked based on a star rating system.
0 runs0 likes0 downloads0 reach0 impact
260 instances - 79 features - classes - 0 missing values
Context Content A few years ago, the United States District Court of Houston had a case that arises under Title VII of the Civil Rights Act of 1964, 42 U.S.C. 200e et seq. The plaintiffs in this case…
0 runs0 likes0 downloads0 reach0 impact
261 instances - 10 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10981, and it has 262 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
262 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10679, and it has 262 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
262 instances - 1026 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
263 instances - 5 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
886 runs0 likes0 downloads0 reach0 impact
264 instances - 5 features - 2 classes - 0 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
2 runs0 likes0 downloads0 reach0 impact
264 instances - 3 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1059 runs0 likes0 downloads0 reach0 impact
264 instances - 3 features - 2 classes - 0 missing values
Context Every year a lot of people migrate to different countries from Pakistan and a lot of them migrate to Pakistan as emigrants of refugees, Pakistan ranks 2nd, according to UNHCR, among the…
0 runs0 likes0 downloads0 reach0 impact
264 instances - 14 features - classes - 0 missing values
Touch Signals
0 runs0 likes0 downloads0 reach0 impact
265 instances - 11 features - classes - 0 missing values
Touch samples 2
0 runs0 likes0 downloads0 reach0 impact
265 instances - 11 features - 8 classes - 0 missing values
Context Teleport.org provides data related to quality of life for the most creative cities around the world, which enables users to find suitable cities for living and working according to their…
0 runs0 likes0 downloads0 reach0 impact
266 instances - 21 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100414, and it has 266 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
266 instances - 1026 features - 0 classes - 0 missing values
SPECT heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Sources: --…
1296 runs0 likes0 downloads0 reach0 impact
267 instances - 23 features - 2 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
65 runs0 likes0 downloads0 reach0 impact
267 instances - 10936 features - 2 classes - 0 missing values
This is a corrected version of the previous data file in version 1, which contained a dataset (349 instances) incorrectly merged from the original training and test sets available on UCI (there are…
0 runs0 likes0 downloads0 reach0 impact
267 instances - 45 features - 2 classes - 0 missing values
Consumer Price Indices (CPI) measure changes over time in general level of prices of goods and services that households acquire for the purpose of consumption. CPI numbers are widely used as a…
0 runs0 likes0 downloads0 reach0 impact
267 instances - 30 features - classes - 208 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 57, and it has 269 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
269 instances - 1026 features - 0 classes - 0 missing values
Context The goal of this dataset is to provide a tidy way to access to the transcripts of speeches given by various US politicians in the context of the 2020 US Presidential Election. Transcripts have…
0 runs0 likes0 downloads0 reach0 impact
269 instances - 6 features - classes - 42 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11907, and it has 270 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
270 instances - 1026 features - 0 classes - 0 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3215 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - 2 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: scaled to [-1,1]
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17033, and it has 270 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
270 instances - 1026 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
Context: The leading cause of death in the developed world is heart disease. Therefore there needs to be work done to help prevent the risks of of having a heart attack or stroke. Content: Use this…
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 240, and it has 272 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
272 instances - 1026 features - 0 classes - 0 missing values
//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…
0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values
//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…
0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values
//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…
0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20033, and it has 273 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
273 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10058, and it has 273 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
273 instances - 1026 features - 0 classes - 0 missing values
No data.
748 runs0 likes0 downloads0 reach0 impact
274 instances - 9 features - 2 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
274 instances - 1143 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 220, and it has 274 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
274 instances - 1026 features - 0 classes - 0 missing values
GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…
77 runs0 likes0 downloads0 reach0 impact
275 instances - 10936 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101407, and it has 276 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
276 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17021, and it has 276 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
276 instances - 1026 features - 0 classes - 0 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
66 runs0 likes0 downloads0 reach0 impact
277 instances - 10 features - 2 classes - 0 missing values
SK daily COVID19
0 runs0 likes0 downloads0 reach0 impact
280 instances - 7 features - classes - 0 missing values
This data was collected from combine primary and secondary sources, through questionnaire, verbal interview and some part of the hospital’s record department’s data, from the selected…
0 runs0 likes0 downloads0 reach0 impact
281 instances - 98 features - 2 classes - 2 missing values
Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…
11 runs0 likes0 downloads0 reach0 impact
283 instances - 54622 features - 3 classes - 0 missing values
Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…
9 runs0 likes0 downloads0 reach0 impact
283 instances - 54622 features - 3 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 243, and it has 283 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
283 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12023, and it has 283 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
283 instances - 1026 features - 0 classes - 0 missing values