OpenML
Filter results by:
This data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social and school related features) and it was…
0 runs0 likes0 downloads0 reach0 impact
395 instances - 33 features - 0 classes - 0 missing values
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies…
0 runs0 likes0 downloads0 reach0 impact
232 instances - 14 features - classes - 60 missing values
Daily electric energy dataset The dee problem involves predicting the daily average price of TkWhe electricity energy in Spain. The data set contains real values from 2003 about the daily consumption…
0 runs0 likes0 downloads0 reach0 impact
365 instances - 7 features - 0 classes - 0 missing values
Electrical Length data set This problem with only two input variables involves a small search space (small complexity). However, it is still an interesting problem since the system is strongly…
0 runs0 likes0 downloads0 reach0 impact
495 instances - 3 features - 0 classes - 0 missing values
Data has been taken from various sources such as data gov and various other websites and has been pre processed for analysis purpose
0 runs0 likes0 downloads0 reach0 impact
204 instances - 5 features - classes - 0 missing values
Global soil saturated hydraulic conductivity measurements for geoscience applications. Total of 1,832 sites with 13,072 Ksat measurements were assembled from published literature and other sources and…
0 runs0 likes0 downloads0 reach0 impact
Global soil hydraulic properties (Ksat, Water Content 33 kPa <2mm, Water Content 1500 kPa <2mm) for geoscience applications. Total of 155,649 measurements were assembled from published…
0 runs0 likes0 downloads0 reach0 impact
tmm hjghjg vjgkjhbb nvhjgb
0 runs0 likes0 downloads0 reach0 impact
748 instances - 5 features - classes - 0 missing values
good
0 runs0 likes0 downloads0 reach0 impact
10 instances - 4 features - classes - 2 missing values
iris-example
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
TEST STUDENTS DATA
0 runs0 likes0 downloads0 reach0 impact
100 instances - 11 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
528 instances - 21 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
528 instances - 21 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
178 instances - 16 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
360 instances - 105 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 26 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
10992 instances - 26 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 35 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
950 instances - 10 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
846 instances - 22 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
846 instances - 22 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
506 instances - 12 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
150 instances - 7 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 42 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 18 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
40768 instances - 14 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
6435 instances - 42 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 25 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2310 instances - 25 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
214 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
214 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 30 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
106 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 8 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 28 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 11 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
2465 instances - 31 features - classes - 0 missing values
Data have been normalized by using the Z-normalization method and divided into two data sets
0 runs0 likes0 downloads0 reach0 impact
20867 instances - 13 features - classes - 0 missing values
This data set includes hourly air pollutants data from 12nationally-controlled air-quality monitoring sites.
0 runs0 likes0 downloads0 reach0 impact
420768 instances - 18 features - classes - 74027 missing values
U. S. Department of Commerce, Bureau of the Census, Census Of Population And Housing 1990 United States: Summary Tape File 1a & 3a (Computer Files), U.S. Department Of Commerce, Bureau Of The Census…
0 runs0 likes0 downloads0 reach0 impact
2215 instances - 147 features - classes - 44592 missing values
There are 10 predictors, all quantitative, and a binary dependent variable, indicating the presence or absence of breast cancer. The predictors are anthropometric data and parameters which can be…
0 runs0 likes0 downloads0 reach0 impact
116 instances - 10 features - 0 classes - 0 missing values
We choose age, delivery number, delivery time, blood pressure and heart status. We classify delivery time to Premature, Timely and Latecomer. As like the delivery time we consider blood pressure in…
0 runs0 likes0 downloads0 reach0 impact
80 instances - 6 features - classes - 0 missing values
This is a data set of Physicochemical Properties of Protein Tertiary Structure. The data set is taken from CASP 5-9. There are 45730 decoys and size varying from 0 to 21 armstrong. RMSD-Size of the…
0 runs0 likes0 downloads0 reach2 impact
45730 instances - 10 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
350 instances - 5 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
370 instances - 5 features - classes - 0 missing values
Sanitized and anonymized Cargo 2000 (C2K) airfreight tracking and tracing events, covering five months of business execution (3,942 process instances, 7,932 transport legs, 56,082 activities). ###…
0 runs0 likes0 downloads0 reach0 impact
3943 instances - 98 features - classes - 210284 missing values
The dataset was collected at 'Hospital Universitario de Caracas' in Caracas, Venezuela. The dataset comprises demographic information, habits, and historic medical records of 858 patients. Several…
0 runs0 likes0 downloads0 reach0 impact
858 instances - 36 features - classes - 3622 missing values
The dataset contains 19 attributes regarding ca cervix behavior risk with class label is ca_cervix with 1 and 0 as values which means the respondent with and without ca cervix, respectively. ###…
0 runs0 likes0 downloads0 reach0 impact
858 instances - 36 features - classes - 3622 missing values
The energy dispersive X-ray fluorescence (EDXRF) was used to determine the chemical composition of celadon body and glaze in Longquan kiln (at Dayao County) and Jingdezhen kiln. Forty typical shards…
0 runs0 likes0 downloads0 reach0 impact
88 instances - 19 features - classes - 0 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
76 instances - 42 features - classes - 574 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
127 instances - 42 features - classes - 722 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
125 instances - 42 features - classes - 362 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
122 instances - 42 features - classes - 906 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
114 instances - 42 features - classes - 562 missing values
This is a part of collection of 8 files containing the match statistics for both women and men at the four major tennis tournaments of the year 2013. Each file has 42 columns and a minimum of 76 rows.…
0 runs0 likes0 downloads0 reach0 impact
126 instances - 42 features - classes - 978 missing values
We aggregated screen movements into screen-fixations using a Salvucci & Goldberg (2000) dispersion-threshold algorithm, and defined Perception Action Cycles (PACs) as fixations with at least one…
0 runs0 likes0 downloads0 reach0 impact
3395 instances - 20 features - classes - 168 missing values
In the dataset there are 5 types of dataset.QCM3, QCM6, QCM7, QCM10, QCM12In each of dataset, there is alcohol classification of five types,1-octanol, 1-propanol, 2-butanol, 2-propanol, 1-isobutanolIn…
0 runs0 likes0 downloads0 reach0 impact
125 instances - 15 features - classes - 0 missing values
The goal of the research is to help the auditors by building a classification model that can predict the fraudulent firm on the basis the present and historical risk factors. The information about the…
0 runs0 likes0 downloads0 reach0 impact
1552 instances - 37 features - 0 classes - 19402 missing values
The researchers of OCLAR Marwan et al. (2019), they gathered Arabic costumer reviews from and Zomato website on wide scope of domain, including restaurants, hotels, hospitals, local shops, etc. The…
0 runs0 likes0 downloads0 reach0 impact
3916 instances - 3 features - classes - 0 missing values
This data set measures the running time of a matrix-matrix product A x B = C, where all matrices have size 2048 x 2048, using a parameterizable SGEMM GPU kernel with 241600 possible parameter…
0 runs0 likes0 downloads0 reach0 impact
241600 instances - 18 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
350 instances - 5 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
438 instances - 5 features - classes - 0 missing values
It is a public set of comments collected for spam research. It has five datasets composed by 1,956 real messages extracted from five videos that were among the 10 most viewed on the collection period.…
0 runs0 likes0 downloads0 reach0 impact
448 instances - 5 features - classes - 245 missing values
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective accident
0 runs0 likes0 downloads0 reach0 impact
363206 instances - 66 features - 0 classes - 876555 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1090 instances - 147 features - 0 classes - 0 missing values
source: http://plato.asu.edu/ftp/solvable.html authors: Rolf-David Bergdoll PAR10 performances of modern solvers on the solvable instances of MIPLIB2010. http://miplib.zib.de/ The algorithm runtime…
0 runs0 likes0 downloads0 reach0 impact
1090 instances - 145 features - 0 classes - 0 missing values
This collection includes data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the biceps brachii muscles of 21 healthy volunteers. The annotation was performed by labeling the…
0 runs0 likes0 downloads0 reach0 impact
347 instances - 8 features - classes - 0 missing values
This collection includes data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the biceps brachii muscles of a single healthy volunteer. The annotation was performed by labeling…
0 runs0 likes0 downloads0 reach0 impact
318 instances - 8 features - classes - 0 missing values
ARFF Training Data
0 runs0 likes0 downloads0 reach0 impact
177640 instances - 40 features - classes - 0 missing values
This is an experimental data set for trying to classify numbers in a lottery as "Highly likely to be picked" or "Not very likely to be picked". It is based on a little more than a…
0 runs0 likes0 downloads0 reach0 impact
12528 instances - 36 features - classes - 0 missing values
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective accident
0 runs0 likes0 downloads0 reach0 impact
This dataset combines records from the MLCQ dataset with metrics extracted using the PMD Tool and the Understand tool, to determine whether a file contains code smells. Please note that the records…
0 runs0 likes0 downloads0 reach0 impact
86467 instances - 67 features - 0 classes - 2852906 missing values
artificial with anomaly
0 runs0 likes0 downloads0 reach0 impact
4032 instances - 3 features - classes - 0 missing values
asdfasd
0 runs0 likes0 downloads0 reach0 impact
761 instances - 42 features - classes - 630 missing values
artificial with anomaly
0 runs0 likes0 downloads0 reach0 impact
4032 instances - 3 features - classes - 0 missing values
This dataset combines records from the MLCQ dataset with metrics extracted using the PMD Tool and the Understand tool, to determine whether a file contains code smells. Please note that the records…
0 runs0 likes0 downloads0 reach0 impact
83943 instances - 67 features - 0 classes - 2801627 missing values
kaggle 30day ml
0 runs0 likes0 downloads0 reach0 impact
300000 instances - 25 features - 0 classes - 0 missing values
The dataset contains information on 13,932 single-family homes sold in Miami in 2016. Besides publicly available information, the dataset creator Steven C. Bourassa has added distance variables,…
0 runs0 likes0 downloads0 reach0 impact
13932 instances - 17 features - 0 classes - 0 missing values
Detailed movie descriptions - ideal for Recommendation Engines
0 runs0 likes0 downloads0 reach0 impact
4803 instances - 11 features - classes - 514 missing values
red wine dataset
0 runs0 likes0 downloads0 reach0 impact
0 instances - 66 features - classes - 0 missing values
dfsgfhgk
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - classes - 0 missing values
sdgt
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
583 instances - 11 features - classes - 0 missing values
I am testing to upload data
0 runs0 likes0 downloads0 reach0 impact
601 instances - 7 features - classes - 0 missing values
This is a dataset about the actors and actresses that have voice characters in Disney movies
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - 2 classes - 0 missing values
This is a dataset about the academic performance of students in different subjects
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
About This dataset is based on https://www.kaggle.com/manjeetsingh/retaildataset Historical sales data covers the period from 2010-02-05 to 2012-11-01. Sales and Features data was merged, Weekly sales…
0 runs0 likes0 downloads0 reach0 impact
149 instances - 12 features - classes - 0 missing values
Data Set Name: Autistic Spectrum Disorder Screening Data for Adult Autistic Spectrum Disorder (ASD) is a neurodevelopment condition associated with significant healthcare costs, and early diagnosis…
0 runs0 likes0 downloads0 reach0 impact
704 instances - 21 features - classes - 192 missing values
Students' Academic Performance Dataset (xAPI-Edu-Data) Data Set Characteristics: Multivariate Number of Instances: 480 Area: E-learning, Education, Predictive models, Educational Data Mining Attribute…
0 runs0 likes0 downloads0 reach0 impact
480 instances - 17 features - classes - 0 missing values
Context The aim of this dataset is to offer in a relatively small number of columns (30) data to compare the performance of some football players, or to compare the efficiency of strikers in-between…
0 runs0 likes0 downloads0 reach0 impact
1177 instances - 32 features - classes - 3867 missing values
Context The Stock Market forecasting and Modelling has always been the problem most researched by analysts, here we will present a dataset of the Indian Stock Exchange - Nifty50 Index . for use for…
0 runs0 likes0 downloads0 reach0 impact
3509 instances - 9 features - classes - 0 missing values
Context Liver disease are in India are increasing due to excessive consumption of alcohol and other harmful substaces present in air or food items or drugs. This dataset is created for predictive…
0 runs0 likes0 downloads0 reach0 impact
583 instances - 12 features - 0 classes - 0 missing values
Context This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one that has been used by ML…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - classes - 0 missing values
Context The subject of this dataset is multi-instrument observations of solar flares. There are a number of space-based instruments that are able to observe solar flares on the Sun; some instruments…
0 runs0 likes0 downloads0 reach0 impact
12455 instances - 17 features - classes - 0 missing values
The pandemic context brings new challenges to cities. This fantastic resource was created by Google with aggregated, anonymized sets of data from users who have turned on the Location History setting…
0 runs0 likes0 downloads0 reach0 impact
1048575 instances - 14 features - classes - 5573689 missing values
Context I am currently building a short term (one-day ahead) electric load forecasting model for Goa. A good chunk of it is domestic household load. Temperature and Humidity can be used to estimate…
0 runs0 likes0 downloads0 reach0 impact
108096 instances - 9 features - classes - 0 missing values
Context Since awareness on COVID-19 began growing across the world, more health datasets have been published as open for (re-)users to utilise in creating platforms and interactive maps, for example,…
0 runs0 likes0 downloads0 reach0 impact
20976 instances - 32 features - classes - 161083 missing values
COVID-19 Dataset for Epidemic Model Development I combined several data sources to gain an integrated dataset involving country-level COVID-19 confirmed, recovered and fatalities cases which can be…
0 runs0 likes0 downloads0 reach0 impact
70464 instances - 11 features - classes - 1835 missing values