OpenML
Filter results by:
Data have been normalized by using the Z-normalization method and divided into two data sets
0 runs0 likes0 downloads0 reach0 impact
20867 instances - 13 features - classes - 0 missing values
The goal of the research is to help the auditors by building a classification model that can predict the fraudulent firm on the basis the present and historical risk factors. The information about the…
0 runs0 likes0 downloads0 reach0 impact
1552 instances - 37 features - 0 classes - 19402 missing values
In the dataset there are 5 types of dataset.QCM3, QCM6, QCM7, QCM10, QCM12In each of dataset, there is alcohol classification of five types,1-octanol, 1-propanol, 2-butanol, 2-propanol, 1-isobutanolIn…
0 runs0 likes0 downloads0 reach0 impact
125 instances - 15 features - classes - 0 missing values
We aggregated screen movements into screen-fixations using a Salvucci & Goldberg (2000) dispersion-threshold algorithm, and defined Perception Action Cycles (PACs) as fixations with at least one…
0 runs0 likes0 downloads0 reach0 impact
3395 instances - 20 features - classes - 168 missing values
The energy dispersive X-ray fluorescence (EDXRF) was used to determine the chemical composition of celadon body and glaze in Longquan kiln (at Dayao County) and Jingdezhen kiln. Forty typical shards…
0 runs0 likes0 downloads0 reach0 impact
88 instances - 19 features - classes - 0 missing values
The dataset contains 19 attributes regarding ca cervix behavior risk with class label is ca_cervix with 1 and 0 as values which means the respondent with and without ca cervix, respectively. ###…
0 runs0 likes0 downloads0 reach0 impact
858 instances - 36 features - classes - 3622 missing values
The dataset was collected at 'Hospital Universitario de Caracas' in Caracas, Venezuela. The dataset comprises demographic information, habits, and historic medical records of 858 patients. Several…
0 runs0 likes0 downloads0 reach0 impact
858 instances - 36 features - classes - 3622 missing values
Sanitized and anonymized Cargo 2000 (C2K) airfreight tracking and tracing events, covering five months of business execution (3,942 process instances, 7,932 transport legs, 56,082 activities). ###…
0 runs0 likes0 downloads0 reach0 impact
3943 instances - 98 features - classes - 210284 missing values
will be updated
0 runs0 likes0 downloads0 reach0 impact
2111 instances - 17 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
370 instances - 5 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
448 instances - 5 features - classes - 245 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
438 instances - 5 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
350 instances - 5 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
350 instances - 5 features - classes - 0 missing values
This is a data set of Physicochemical Properties of Protein Tertiary Structure. The data set is taken from CASP 5-9. There are 45730 decoys and size varying from 0 to 21 armstrong. RMSD-Size of the…
0 runs0 likes0 downloads0 reach2 impact
45730 instances - 10 features - classes - 0 missing values
We choose age, delivery number, delivery time, blood pressure and heart status. We classify delivery time to Premature, Timely and Latecomer. As like the delivery time we consider blood pressure in…
0 runs0 likes0 downloads0 reach0 impact
80 instances - 6 features - classes - 0 missing values
There are 10 predictors, all quantitative, and a binary dependent variable, indicating the presence or absence of breast cancer. The predictors are anthropometric data and parameters which can be…
0 runs0 likes0 downloads0 reach0 impact
116 instances - 10 features - 0 classes - 0 missing values
One of the primary challenges in identifying the risks of the Burst Header Packet (BHP) flood attacks in Optical Burst Switching networks (OBS) is the scarcity of reliable historical data. ###…
0 runs0 likes0 downloads0 reach0 impact
1075 instances - 22 features - classes - 15 missing values
This data is for the purpose of bias correction of next-day maximum and minimum air temperatures forecast of the LDAPS model operated by the Korea Meteorological Administration over Seoul, South…
0 runs0 likes0 downloads0 reach0 impact
7752 instances - 25 features - classes - 1248 missing values
The database was created with records of behavior of the urban traffic of the city of Sao Paulo in Brazil from December 14, 2009 to December 18, 2009 (From Monday to Friday). Registered from 7:00 to…
0 runs0 likes0 downloads0 reach0 impact
135 instances - 18 features - classes - 0 missing values
Author: Francesca Grisoni, Claudia S. Neuhaus, Miyabi Hishinuma, Gisela Gabernet, Jan A. Hiss, - Masaaki Kotera, Gisbert Schneider Source:…
0 runs0 likes0 downloads0 reach0 impact
949 instances - 3 features - classes - 0 missing values
Author: Francesca Grisoni, Claudia S. Neuhaus, Miyabi Hishinuma, Gisela Gabernet, Jan A. Hiss, - Masaaki Kotera, Gisbert Schneider Source:…
0 runs0 likes0 downloads0 reach0 impact
901 instances - 3 features - classes - 0 missing values
This hourly data set contains the PM2.5 data of US Embassy in Beijing. Meanwhile, meteorological data from Beijing Capital International Airport are also included.
0 runs0 likes0 downloads0 reach0 impact
43824 instances - 13 features - classes - 2067 missing values
The AI4I 2020 Predictive Maintenance Dataset is a synthetic dataset that reflects real predictive maintenance data encountered in industry. Since real predictive maintenance datasets are generally…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 14 features - classes - 0 missing values
We scraped a large number of eBay auctions of a popular product. After preprocessing the auction data, we build the SB dataset. The goal is to share the labelled SB dataset with the researchers.
0 runs0 likes0 downloads0 reach0 impact
6321 instances - 13 features - 0 classes - 0 missing values
The database was created with records of absenteeism at work from July 2007 to July 2010 at a courier company in Brazil. The data set allows for several new combinations of attributes and attribute…
0 runs0 likes0 downloads0 reach0 impact
740 instances - 21 features - classes - 0 missing values
dgf_test
0 runs0 likes0 downloads0 reach0 impact
3415 instances - 5 features - 2 classes - 1 missing values
dgf_test
0 runs0 likes0 downloads0 reach0 impact
3415 instances - 5 features - 2 classes - 1 missing values
1987 National Indonesia Contraceptive Prevalence Survey
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - classes - 0 missing values
The dataset is about bankruptcy prediction of Polish companies. The data was collected from Emerging Markets Information Service (EMIS, [Web Link]), which is a database containing information on…
0 runs0 likes0 downloads0 reach0 impact
7027 instances - 65 features - classes - 5835 missing values
Autistic Spectrum Disorder (ASD) is a neurodevelopment condition associated with significant healthcare costs, and early diagnosis can significantly reduce these. Unfortunately, waiting times for an…
0 runs0 likes0 downloads0 reach0 impact
704 instances - 21 features - classes - 192 missing values
This dataset contains 206 attributes of 70 children with physical and motor disability based on ICF-CY. In particular, the SCADI dataset is the only one that has been used by ML researchers for…
0 runs0 likes0 downloads0 reach0 impact
70 instances - 206 features - classes - 0 missing values
This dataset describes 100,000 realistic, synthetically generated worker compensation insurance claims. Along the ultimate financial losses, each claim is described by the initial case estimate, date…
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 14 features - 0 classes - 0 missing values
Laboratorio_dataset_car
0 runs0 likes0 downloads0 reach0 impact
1750 instances - 7 features - classes - 0 missing values
Laboratory dataset
0 runs0 likes0 downloads0 reach0 impact
1750 instances - 7 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
178 instances - 16 features - classes - 0 missing values
#### Information A small classic dataset from Fisher, 1936. One of the earliest datasets used for the evaluation of classification methodologies. #### References * Fisher, R. A. (1936), The use of…
0 runs0 likes1 downloads1 reach0 impact
150 instances - 7 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1484 instances - 18 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
194 instances - 32 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
178 instances - 16 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
528 instances - 21 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
528 instances - 21 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
846 instances - 22 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
846 instances - 22 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
950 instances - 10 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 35 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 25 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 25 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
6435 instances - 42 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 26 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 26 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
360 instances - 105 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 42 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
150 instances - 7 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
506 instances - 12 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 30 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
214 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
214 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
40768 instances - 14 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 18 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
336 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 31 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 11 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 8 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
106 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
252 instances - 14 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
5472 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
841 instances - 74 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
841 instances - 74 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
8378 instances - 123 features - classes - 18372 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
37 instances - 19 features - classes - 0 missing values
No data.
4 runs0 likes0 downloads0 reach0 impact
45918 instances - 22 features - 0 classes - 0 missing values
No data.
7 runs0 likes0 downloads0 reach0 impact
45918 instances - 22 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
7349 instances - 49153 features - 37 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
7349 instances - 49153 features - 37 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
41188 instances - 21 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 27649 features - 2 classes - 0 missing values
No data.
2 runs0 likes0 downloads0 reach0 impact
16000 instances - 27649 features - 2 classes - 0 missing values
No data.
7 runs0 likes0 downloads0 reach0 impact
4000 instances - 27649 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 27649 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
26 instances - 5 features - classes - 0 missing values
https://www.kaggle.com/dansbecker/nba-shot-logs
0 runs0 likes0 downloads0 reach0 impact
128069 instances - 21 features - classes - 5567 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2494 instances - 28 features - 9 classes - 99 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1737 missing values
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective accident.…
0 runs0 likes1 downloads1 reach1 impact
363243 instances - 67 features - 3 classes - 2181757 missing values
PM 2.5 datasetd
0 runs0 likes0 downloads0 reach0 impact
43800 instances - 10 features - classes - 0 missing values
#test data for mlp
0 runs0 likes0 downloads0 reach0 impact
200 instances - 12 features - classes - 0 missing values
tesl dataset about L
0 runs0 likes0 downloads0 reach0 impact
150000 instances - 8 features - classes - 0 missing values
service data
0 runs0 likes0 downloads0 reach0 impact
34 instances - 8 features - classes - 0 missing values
testing
0 runs0 likes0 downloads0 reach0 impact
3279 instances - 1559 features - classes - 0 missing values
Outliers data set extracted from the Illustration (Fig. 3) in "Novelty detection with application to data streams"
0 runs0 likes0 downloads0 reach0 impact
75 instances - 3 features - 4 classes - 0 missing values
50 Danish words with their pronunciation from Dansk Ordbog
0 runs0 likes0 downloads0 reach0 impact
51 instances - 2 features - classes - 2 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 14616450 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 14616450 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 14616450 missing values
SK daily COVID19
0 runs0 likes0 downloads0 reach0 impact
280 instances - 7 features - classes - 0 missing values
This is a smaller version of the original dataset, containing 1M rows. ### Attribute Information * The first column is the class label (1 for signal, 0 for background) * 21 low-level features…
0 runs0 likes0 downloads0 reach2 impact
1000000 instances - 29 features - 2 classes - 0 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1737 missing values