OpenML
Filter results by:
Pulsar candidates collected during the HTRU survey. Pulsars are a type of star, of considerable scientific interest. Candidates must be classified in to pulsar and non-pulsar classes to aid discovery.…
0 runs0 likes0 downloads0 reach0 impact
17898 instances - 9 features - 2 classes - 0 missing values
diabetes
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - classes - 0 missing values
Hourly Interstate 94 Westbound traffic volume for MN DoT ATR station 301, roughly midway between Minneapolis and St Paul, MN. Hourly weather features and holidays included for impacts on traffic…
0 runs0 likes0 downloads0 reach0 impact
48204 instances - 9 features - classes - 0 missing values
Context The unprocessed dataset was acquired from UCI Machine Learning organisation. This dataset is preprocessed by me, originally from the National Institute of Diabetes and Digestive and Kidney…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
DESCRIPTION Problem Statement NIDDK (National Institute of Diabetes and Digestive and Kidney Diseases) research creates knowledge about and treatments for the most chronic, costly, and consequential…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
Context Well, what happened was that I was looking for a semi-definite easy-to-read list of international football matches and couldn't find anything decent. So I took it upon myself to collect it for…
0 runs0 likes0 downloads0 reach0 impact
41586 instances - 9 features - classes - 0 missing values
Context Well, what happened was that I was looking for a semi-definite easy-to-read list of international football matches and couldn't find anything decent. So I took it upon myself to collect it for…
0 runs0 likes0 downloads0 reach0 impact
41586 instances - 9 features - classes - 0 missing values
Modified version of subsampled dataset from Tencent Inc. on OpenML. Duplicate rows are dropped. Columns with a high ratio of unique values are dropped. Some columns are cast to factor.
0 runs0 likes0 downloads0 reach0 impact
39926 instances - 9 features - 2 classes - 0 missing values
Nursery Database was derived from a hierarchical decision model originally developed to rank applications for nursery schools.
0 runs0 likes0 downloads0 reach0 impact
12960 instances - 9 features - 5 classes - 0 missing values
This dataset was obtained from the UCI Repository. The dataset was created by Angeliki Xifara (angxifara '@' gmail.com, Civil/Structural Engineer) and was processed by Athanasios Tsanas…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
Subsampling of the dataset electricity (44156) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset electricity (44156) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset electricity (44156) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset electricity (44156) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset electricity (44156) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on both numerical and categorical…
0 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
5465575 instances - 9 features - 0 classes - 0 missing values
test001
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - classes - 0 missing values
Context This data set aims to provide a start up for the ones who just started off with deep learning and act as a benchmark. Content The feature set includes: Cement Blast Furnace Slag Fly Ash Water…
0 runs0 likes0 downloads0 reach0 impact
1030 instances - 9 features - classes - 0 missing values
Interest and Motivation This dataset belongs to the MeToo movement on Twitter. This movement was against the sexual harassment incidents and many people posted various hatred tweets. Using this…
0 runs0 likes0 downloads0 reach0 impact
807174 instances - 9 features - classes - 197782 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 9 features - 2 classes - 0 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes0 downloads0 reach0 impact
1795 instances - 9 features - 42 classes - 1 missing values
https://archive.ics.uci.edu/ml/datasets/Diabetes
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - classes - 0 missing values
Context Chocolate is one of the most popular candies in the world. Each year, residents of the United States collectively eat more than 2.8 billions pounds. However, not all chocolate bars are created…
0 runs0 likes0 downloads0 reach0 impact
1795 instances - 9 features - classes - 962 missing values
Context Chocolate is one of the most popular candies in the world. Each year, residents of the United States collectively eat more than 2.8 billions pounds. However, not all chocolate bars are created…
0 runs0 likes0 downloads0 reach0 impact
1795 instances - 9 features - classes - 962 missing values
Cryptocurrencies Cryptocurrencies are fast becoming rivals to traditional currency across the world. The digital currencies are available to purchase in many different places, making it accessible to…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
CryptocurrenciesCryptocurrenciesarefastbecomingrivalstotraditionalcurrencyacrosstheworldThedigitalcurrenciesareavailabletopurchaseinmanydifferentplacesmakingitaccessibletoeveryoneandwithretailersacceptingvariouscryptocurrenciesitcouldbeasignthatmoneyasweknowitisabouttogothroughamajorchangeInadditiontheblockchaintechnologyonwhichmanycryptocurrenciesarebasedwithitsrevolutionarydistributeddigitalbackbonehasmanyotherpromisingapplicationsImplementationsofsecuredecentralizedsystemscanaidusinconqueringorganizationalissuesoftrustandsecuritythathaveplaguedoursocietythroughouttheagesIneffectwecanfundamentallydisruptindustriescoretoeconomiesbusinessesandsocialstructureseliminatinginefficiencyandhumanerrorContentThedatasetcontainsallhistoricaldailypricesopenhighlowcloseforallcryptocurrencieslistedonCoinMarketCapAcknowledgementsEveryCryptocurrencyDailyMarketPriceIinitiallydevelopedkernelsforthisdatasetbeforemakingmyownscraperanddatasetsothatIcouldkeepitregularlyupdatedCoinMarketCapForthedata…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
CryptocurrenciesCryptocurrencies are fast becoming rivals to traditional currency across the world. The digital currencies are available to purchase in many different places, making it accessible to…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
Context This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective is to predict based on diagnostic measurements whether a patient has…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
Description: Pulsars are a rare type of Neutron star that produce radio emission detectable here on Earth. They are of considerable scientific interest as probes of space-time, the interstellar…
0 runs0 likes0 downloads0 reach0 impact
17897 instances - 9 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 9 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
10692 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
20634 instances - 9 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
20634 instances - 9 features - 2 classes - 0 missing values
Binarized version of the California Housing Dataset This dataset was obtained from Luis Torgo's collection of regression datasets. It was binarized to serve as the original, unprocessed date for the…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 2 classes - 0 missing values
1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of Primary Industry and Fisheries, Tasmania…
34899 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 28 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1187 runs0 likes0 downloads0 reach0 impact
412 instances - 9 features - 7 classes - 96 missing values
This is data set is concerned with the forward kinematics of an 8 link robot arm. Among the existing variants of this data set we have used the variant 8nm, which is known to be highly non-linear and…
23 runs0 likes0 downloads0 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
A family of datasets synthetically generated from a simulation of how bank-customers choose their banks. Tasks are based on predicting the fraction of bank customers who leave the bank because of full…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
107 runs0 likes0 downloads0 reach0 impact
74 instances - 9 features - 2 classes - 0 missing values
No data.
747 runs0 likes0 downloads0 reach0 impact
369 instances - 9 features - 2 classes - 0 missing values
1. Title: Pima Indians Diabetes Database 2. Sources: (a) Original owners: National Institute of Diabetes and Digestive and Kidney Diseases (b) Donor of database: Vincent Sigillito…
203503 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 2 classes - 0 missing values
1. Title: Postoperative Patient Data 2. Source Information: -- Creators: Sharon Summers, School of Nursing, University of Kansas Medical Center, Kansas City, KS 66160 Linda Woolery, School of Nursing,…
1758 runs0 likes0 downloads0 reach0 impact
90 instances - 9 features - 3 classes - 3 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
119 runs0 likes0 downloads0 reach0 impact
95 instances - 9 features - 2 classes - 9 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes0 downloads0 reach0 impact
1794 instances - 9 features - 41 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
4324 instances - 9 features - classes - 3360 missing values
Make target (age) numeric**Author**: 1. Title of Database: Abalone data 2. Sources: (a) Original owners of database: Marine Resources Division Marine Research Laboratories - Taroona Department of…
1 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 0 classes - 0 missing values
Content The data includes the text, whether the tweet is a retweet, whether the tweet is deleted, and so much more. It is sorted by descending date (so the highest rows are from 2009 and the last rows…
0 runs0 likes0 downloads0 reach0 impact
56571 instances - 9 features - classes - 0 missing values
The similarities and differences in the behaviors of different people have long been of interest, particularly in psychology and other social science fields. Understanding human behavior in particular…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 9 features - classes - 63798 missing values
Content This is a huge dataset that contains every web series around the globe streaming right now at the date of the creation of the dataset. Inspiration This dataset can be used to answer the…
0 runs0 likes0 downloads0 reach0 impact
12353 instances - 9 features - classes - 9257 missing values
Context This dataset was prepared for the journal article entitled "On the prediction of critical heat flux using a physics-informed machine learning-aided framework" (doi:…
0 runs0 likes0 downloads0 reach0 impact
1865 instances - 9 features - classes - 0 missing values
Short Description Here you are a dataset containing the top 100 coins by their total volume across all markets during 9th January of 2021. The prices are in USD dollars. If you need a different or…
0 runs0 likes0 downloads0 reach0 impact
40100 instances - 9 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
10692 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
10692 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
10692 instances - 9 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
Subsampling of the dataset california (44090) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset california (44090) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset california (44090) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset california (44090) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
Subsampling of the dataset california (44090) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 9 features - 2 classes - 0 missing values
postoperative-patient-data-pmlb
26 runs0 likes0 downloads0 reach0 impact
88 instances - 9 features - 2 classes - 0 missing values
Twenty two observations of the Dwarf planet Ceres as observed by Giueseppe Piazzi and published in the September edition of Monatlicher Correspondenz in 1801. These were the measurements used by Gauss…
0 runs0 likes0 downloads0 reach0 impact
22 instances - 9 features - classes - 17 missing values
Incident reports from the San Franciso Police Department between January 2003 and May 2018, provided by the City and County of San Francisco. The dataset was downloaded on 05.11.2018. from…
0 runs1 likes0 downloads1 reach1 impact
2215023 instances - 9 features - 2 classes - 0 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 9 features - 2 classes - 0 missing values
Predicting the age of abalone from physical measurements. The age of abalone is determined by cutting the shell through the cone, staining it, and counting the number of rings through a microscope --…
0 runs0 likes0 downloads0 reach0 impact
4177 instances - 9 features - 0 classes - 0 missing values
Concrete is the most important material in civil engineering. The concrete compressive strength is a highly nonlinear function of age and ingredients. Each instance represents a description, different…
0 runs0 likes0 downloads0 reach0 impact
1030 instances - 9 features - 0 classes - 0 missing values
Information on the variables was collected using all the block groups in California from the 1990 Census. In this sample a block group on average includes 1425.5 individuals living in a geographically…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
This dataset looked into assessing the heating load and cooling load requirements of buildings (that is, energy efficiency) as a function of building parameters. Energy analysis is performed using 12…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
No data.
2198 runs1 likes17 downloads18 reach10 impact
1484 instances - 9 features - 10 classes - 0 missing values
A realistic simulation of the forward dynamics of an 8 link all-revolute robot arm. The task in all datasets is to predict the distance of the end-effector from a target. The input are the angular…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 9 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
15 instances - 10 features - 0 classes - 0 missing values
This file contains the data in "The MU284 Population" from Appendix B of the book "Model Assisted Survey Sampling" by Sarndal, Swensson and Wretman, published by Springer-Verlag, New York, 1992. The…
0 runs0 likes0 downloads0 reach0 impact
284 instances - 10 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
103 runs0 likes0 downloads0 reach0 impact
92 instances - 10 features - 2 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
16080 runs0 likes0 downloads0 reach0 impact
672 instances - 10 features - 2 classes - 1200 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
15 instances - 10 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
100 instances - 10 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
18 runs0 likes0 downloads0 reach0 impact
159 instances - 10 features - 0 classes - 6 missing values
This dataset contains 3 more features compared to version 1 of the same dataset. Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 10 features - 0 classes - 38 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Survival treated as the class attribute As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
12 runs0 likes0 downloads0 reach0 impact
130 instances - 10 features - 0 classes - 97 missing values
This is a dataset obtained from the StatLib repository. Here is the included description: The data provided are daily stock prices from January 1988 through October 1991, for ten aerospace companies.…
5 runs0 likes0 downloads0 reach0 impact
950 instances - 10 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 0 classes - 9 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
0 runs0 likes0 downloads0 reach0 impact
42 instances - 10 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach0 impact
52 instances - 10 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes0 downloads0 reach0 impact
950 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
720 runs0 likes0 downloads0 reach0 impact
159 instances - 10 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
708 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 2 classes - 9 missing values
* Source: JP Marques de Sá, INEB-Instituto de Engenharia Biomédica, Porto, Portugal; e-mail: jpmdesa '@' gmail.com J Jossinet, inserm, Lyon, France * Data Set Information: Impedance measurements…
280 runs0 likes0 downloads0 reach0 impact
106 instances - 10 features - 6 classes - 0 missing values
This database encodes the complete set of possible board configurations at the end of tic-tac-toe games, where "x" is assumed to have played first. The target concept is "win for x" (i.e., true when…
386788 runs0 likes0 downloads0 reach0 impact
958 instances - 10 features - 2 classes - 0 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2009 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 2 classes - 9 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
728 runs0 likes0 downloads0 reach0 impact
52 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
767 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 2 classes - 0 missing values
Datasets for `Pattern Recognition and Neural Networks' by B.D. Ripley ===================================================================== Cambridge University Press (1996) ISBN 0-521-46086-7 The…
640 runs0 likes0 downloads0 reach0 impact
214 instances - 10 features - 6 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
112 runs0 likes0 downloads0 reach0 impact
42 instances - 10 features - 2 classes - 0 missing values