OpenML

fri_c3_250_25 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values

fri_c0_250_5 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

853 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values

fri_c0_250_25 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

746 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values

fri_c0_250_10 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

773 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values

fri_c1_250_50 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

755 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values

fri_c4_250_50 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

817 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values

fri_c2_250_5 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

847 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values

AP_Lung_Uterus (1)

GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…

77 runs0 likes0 downloads0 reach0 impact
250 instances - 10936 features - 2 classes - 0 missing values

fri_c2_250_25 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values

fri_c1_250_5 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values

fri_c2_250_5 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values

fri_c3_250_10 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values

fri_c0_250_50 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values

fri_c1_250_5 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

854 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values

fri_c2_250_10 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

782 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values

fri_c4_250_25 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

806 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values

fri_c1_250_10 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

791 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values

qualitative-bankruptcy (1)

* Abstract: Predict the Bankruptcy from Qualitative parameters from experts. * Source: Source Information -- Creator : Mr.A.Martin(jayamartin '@' yahoo.com) Mr.J.Uthayakumar (uthayakumar17691 '@'…

147 runs0 likes0 downloads0 reach0 impact
250 instances - 7 features - 2 classes - 0 missing values

Reliance-Stock-DataJul-2019---Jul-2020 (1)

Context This Data set Contains the values of Stock of Amazon which dates between 15 July ,2019 to 15 July,2020 Content The Data set contains 7 different columns that includes Date, The opening value…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 8 features - classes - 23 missing values

fri_c0_250_5 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values

fri_c4_250_100 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 101 features - 0 classes - 0 missing values

fri_c3_250_5 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 0 classes - 0 missing values

fri_c1_250_25 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values

fri_c4_250_10 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values

fri_c4_250_50 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values

fri_c3_250_5 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

822 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values

fri_c0_250_50 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

748 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values

fri_c1_250_25 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

764 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values

fri_c2_250_50 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

786 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values

fri_c4_250_10 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

773 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values

fri_c3_250_50 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

775 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 2 classes - 0 missing values

fri_c0_250_25 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values

fri_c1_250_10 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values

fri_c3_250_50 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values

fri_c0_250_10 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 0 classes - 0 missing values

fri_c1_250_50 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values

fri_c2_250_50 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 51 features - 0 classes - 0 missing values

fri_c4_250_25 (1)

The Friedman datasets are 80 artificially generated datasets originating from: J.H. Friedman (1999). Stochastic Gradient Boosting The dataset names are coded as…

0 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 0 classes - 0 missing values

fri_c3_250_10 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

791 runs0 likes0 downloads0 reach0 impact
250 instances - 11 features - 2 classes - 0 missing values

fri_c2_250_25 (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

812 runs0 likes0 downloads0 reach0 impact
250 instances - 26 features - 2 classes - 0 missing values

QSAR-TID-175 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 175, and it has 251 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
251 instances - 1026 features - 0 classes - 0 missing values

bodyfat (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

769 runs0 likes0 downloads0 reach0 impact
252 instances - 15 features - 2 classes - 0 missing values

bodyfat (3)

No data.

0 runs0 likes0 downloads0 reach0 impact
252 instances - 14 features - classes - 0 missing values

bodyfat (1)

Short Summary: Lists estimates of the percentage of body fat determined by underwater weighing and various body circumference measurements for 252 men. Classroom use of this data set: This data set…

54 runs0 likes0 downloads0 reach0 impact
252 instances - 15 features - 0 classes - 0 missing values

MeanWhile1 (1)

Mean While 1

0 runs0 likes0 downloads0 reach0 impact
253 instances - 38 features - 2 classes - 0 missing values

MegaWatt1 (1)

Mega watt

183 runs0 likes0 downloads0 reach0 impact
253 instances - 38 features - 2 classes - 0 missing values

Ovarian (1)

No data.

0 runs0 likes0 downloads0 reach0 impact
253 instances - 15155 features - 2 classes - 0 missing values

COVID-19-biotech-companies-on-stock-exchange(2020) (1)

The coronavirus pandemic has affected the entire world and many families have been destroyed. The stock exchange was also affected, but vaccine companies took advantage of this moment and leveraged…

0 runs0 likes0 downloads0 reach0 impact
255 instances - 36 features - classes - 35 missing values

QSAR-TID-20158 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20158, and it has 257 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
257 instances - 1026 features - 0 classes - 0 missing values

AP_Endometrium_Ovary (1)

GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…

66 runs0 likes0 downloads0 reach0 impact
259 instances - 10936 features - 2 classes - 0 missing values

GDP-per-capita-all-countries (1)

Gross Domestic Product (GDP) is the monetary value of all finished goods and services made within a country during a specific period. GDP provides an economic snapshot of a country, used to estimate…

0 runs0 likes0 downloads0 reach0 impact
260 instances - 32 features - classes - 1074 missing values

movies (2)

This dataset includes demographic data for users who have rated the top 15 most-rated movies, ranked based on a star rating system.

0 runs0 likes0 downloads0 reach0 impact
260 instances - 79 features - classes - 0 missing values

movies (3)

This dataset includes demographic data for users who have rated the top 15 most-rated movies, ranked based on a star rating system.

0 runs0 likes0 downloads0 reach0 impact
260 instances - 79 features - classes - 0 missing values

Gender-discrimination (1)

Context Content A few years ago, the United States District Court of Houston had a case that arises under Title VII of the Civil Rights Act of 1964, 42 U.S.C. 200e et seq. The plaintiffs in this case…

0 runs0 likes0 downloads0 reach0 impact
261 instances - 10 features - classes - 0 missing values

QSAR-TID-10981 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10981, and it has 262 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
262 instances - 1026 features - 0 classes - 0 missing values

QSAR-TID-10679 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10679, and it has 262 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
262 instances - 1026 features - 0 classes - 0 missing values

gnfuv-pi-3 (1)

test

0 runs0 likes0 downloads0 reach0 impact
263 instances - 5 features - classes - 0 missing values

analcatdata_lawsuit (1)

analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…

886 runs0 likes0 downloads0 reach0 impact
264 instances - 5 features - 2 classes - 0 missing values

rmftsa_ctoarrivals (1)

Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…

2 runs0 likes0 downloads0 reach0 impact
264 instances - 3 features - 0 classes - 0 missing values

rmftsa_ctoarrivals (2)

Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…

1059 runs0 likes0 downloads0 reach0 impact
264 instances - 3 features - 2 classes - 0 missing values

Pakistans-Migration-History (1)

Context Every year a lot of people migrate to different countries from Pakistan and a lot of them migrate to Pakistan as emigrants of refugees, Pakistan ranks 2nd, according to UNHCR, among the…

0 runs0 likes0 downloads0 reach0 impact
264 instances - 14 features - classes - 0 missing values

Touch (1)

Touch Signals

0 runs0 likes0 downloads0 reach0 impact
265 instances - 11 features - classes - 0 missing values

Touch2 (1)

Touch samples 2

0 runs0 likes0 downloads0 reach0 impact
265 instances - 11 features - 8 classes - 0 missing values

City-Quality-of-Life-Dataset (1)

Context Teleport.org provides data related to quality of life for the most creative cities around the world, which enables users to find suitable cities for living and working according to their…

0 runs0 likes0 downloads0 reach0 impact
266 instances - 21 features - classes - 0 missing values

QSAR-TID-100414 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100414, and it has 266 rows and 1026 features…

1 runs0 likes0 downloads0 reach0 impact
266 instances - 1026 features - 0 classes - 0 missing values

SPECT (1)

SPECT heart data This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Sources: --…

1296 runs0 likes0 downloads0 reach0 impact
267 instances - 23 features - 2 classes - 0 missing values

AP_Prostate_Ovary (1)

GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…

65 runs0 likes0 downloads0 reach0 impact
267 instances - 10936 features - 2 classes - 0 missing values

SPECTF (2)

This is a corrected version of the previous data file in version 1, which contained a dataset (349 instances) incorrectly merged from the original training and test sets available on UCI (there are…

0 runs0 likes0 downloads0 reach0 impact
267 instances - 45 features - 2 classes - 0 missing values

Consumer-Price-Index (1)

Consumer Price Indices (CPI) measure changes over time in general level of prices of goods and services that households acquire for the purpose of consumption. CPI numbers are widely used as a…

0 runs0 likes0 downloads0 reach0 impact
267 instances - 30 features - classes - 208 missing values

QSAR-TID-57 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 57, and it has 269 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
269 instances - 1026 features - 0 classes - 0 missing values

US-2020-Presidential-Election-Speeches (1)

Context The goal of this dataset is to provide a tidy way to access to the transcripts of speeches given by various US politicians in the context of the 2020 US Presidential Election. Transcripts have…

0 runs0 likes0 downloads0 reach0 impact
269 instances - 6 features - classes - 42 missing values

QSAR-TID-11907 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11907, and it has 270 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
270 instances - 1026 features - 0 classes - 0 missing values

heart-statlog (1)

This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…

3215 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - 2 classes - 0 missing values

heart (1)

libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: scaled to [-1,1]

0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - 0 classes - 0 missing values

QSAR-TID-17033 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17033, and it has 270 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
270 instances - 1026 features - 0 classes - 0 missing values

dataset_time_1 (1)

test

0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values

dataset_53_heart-statlog (1)

test

0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values

dataset_time (1)

test

0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values

dataset_time (2)

test

0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values

dataset_time (3)

test

0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values

Heart-Disease-Prediction (1)

Context: The leading cause of death in the developed world is heart disease. Therefore there needs to be work done to help prevent the risks of of having a heart attack or stroke. Content: Use this…

0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values

QSAR-TID-240 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 240, and it has 272 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
272 instances - 1026 features - 0 classes - 0 missing values

PhosphoproteinChallenge_DREAM3 (2)

//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…

0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values

PhosphoproteinChallenge_DREAM3 (1)

//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…

0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values

PhosphoproteinChallenge_DREAM3 (3)

//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…

0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values

QSAR-TID-20033 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20033, and it has 273 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
273 instances - 1026 features - 0 classes - 0 missing values

QSAR-TID-10058 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10058, and it has 273 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
273 instances - 1026 features - 0 classes - 0 missing values

jEdit_4.0_4.2 (1)

No data.

748 runs0 likes0 downloads0 reach0 impact
274 instances - 9 features - 2 classes - 0 missing values

mtp2 (1)

This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…

0 runs0 likes0 downloads0 reach0 impact
274 instances - 1143 features - 0 classes - 0 missing values

QSAR-TID-220 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 220, and it has 274 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
274 instances - 1026 features - 0 classes - 0 missing values

AP_Omentum_Ovary (1)

GEMLeR provides a collection of gene expression datasets that can be used for benchmarking gene expression oriented machine learning algorithms. They can be used for estimation of different quality…

77 runs0 likes0 downloads0 reach0 impact
275 instances - 10936 features - 2 classes - 0 missing values

QSAR-TID-101407 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101407, and it has 276 rows and 1026 features…

1 runs0 likes0 downloads0 reach0 impact
276 instances - 1026 features - 0 classes - 0 missing values

QSAR-TID-17021 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17021, and it has 276 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
276 instances - 1026 features - 0 classes - 0 missing values

breast-cancer-dropped-missing-attributes-values (1)

Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…

66 runs0 likes0 downloads0 reach0 impact
277 instances - 10 features - 2 classes - 0 missing values

SKdailyCOVID19 (1)

SK daily COVID19

0 runs0 likes0 downloads0 reach0 impact
280 instances - 7 features - classes - 0 missing values

DiabeticMellitus (1)

This data was collected from combine primary and secondary sources, through questionnaire, verbal interview and some part of the hospital’s record department’s data, from the selected…

0 runs0 likes0 downloads0 reach0 impact
281 instances - 98 features - 2 classes - 2 missing values

ovarianTumour (1)

Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…

11 runs0 likes0 downloads0 reach0 impact
283 instances - 54622 features - 3 classes - 0 missing values

hepatitisC (1)

Data from the RSCTC 2010 Discovery Challenge. All datasets contain between 100 and 400 samples, characterized by values of 20,000 - 65,000 attributes. Samples are assigned to several (2-10) classes.…

9 runs0 likes0 downloads0 reach0 impact
283 instances - 54622 features - 3 classes - 0 missing values

QSAR-TID-243 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 243, and it has 283 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
283 instances - 1026 features - 0 classes - 0 missing values

QSAR-TID-12023 (1)

This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12023, and it has 283 rows and 1026 features (including…

1 runs0 likes0 downloads0 reach0 impact
283 instances - 1026 features - 0 classes - 0 missing values

Sign in

Filter results by: