OpenML
Filter results by:
Pulsar candidates collected during the HTRU survey. Pulsars are a type of star, of considerable scientific interest. Candidates must be classified in to pulsar and non-pulsar classes to aid discovery.…
0 runs0 likes0 downloads0 reach0 impact
17898 instances - 9 features - 2 classes - 0 missing values
Description: Pulsars are a rare type of Neutron star that produce radio emission detectable here on Earth. They are of considerable scientific interest as probes of space-time, the interstellar…
0 runs0 likes0 downloads0 reach0 impact
17897 instances - 9 features - classes - 0 missing values
Context The dataset is from a rental price prediction project I did. Includes different types of properties (Apartments, Independent floors, Independent houses, Villas etc.) It contains 12000 rental…
0 runs0 likes0 downloads0 reach0 impact
17890 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
17496 instances - 10 features - 0 classes - 0 missing values
Electric power distribution, hourly data. From original source: ----- The electric power distribution problem is the distribution of electricity to different areas depends on its sequential usage. But…
0 runs0 likes0 downloads0 reach0 impact
17420 instances - 10 features - classes - 0 missing values
Electric power distribution, hourly data. From original source: ----- The electric power distribution problem is the distribution of electricity to different areas depends on its sequential usage. But…
0 runs0 likes0 downloads0 reach0 impact
17420 instances - 10 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 12 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 12 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 7 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 13 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 13 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 7 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 7 features - 0 classes - 0 missing values
Context Candy Crush Saga is a hit mobile game developed by King (part of ActivisionBlizzard) that is played by millions of people all around the world. The game is structured as a series of levels…
0 runs0 likes0 downloads0 reach0 impact
16865 instances - 5 features - 0 classes - 0 missing values
Problem Statement The following Data Analysis of Marketing Campaigns is a part of the Assignment for Data Science Intern at Merkle Sokrati. Objectives of the Task Carry out EDA and build ML model to…
0 runs0 likes0 downloads0 reach0 impact
16834 instances - 16 features - classes - 546 missing values
Historical Rainfall data of Bangladesh
0 runs0 likes0 downloads0 reach0 impact
16755 instances - 4 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16714 instances - 11 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16714 instances - 11 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16714 instances - 11 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16714 instances - 11 features - 2 classes - 0 missing values
Has the curve flattened? Countries around the world are working to flatten the curve of the coronavirus pandemic. Flattening the curve involves reducing the number of new COVID-19 cases from one day…
0 runs0 likes0 downloads0 reach0 impact
16659 instances - 28 features - classes - 24650 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
16644 instances - 18 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
16644 instances - 18 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
16644 instances - 18 features - 2 classes - 0 missing values
This data set is also obtained from the task of controlling a F16 aircraft, although the target variable and attributes are different from the ailerons domain. In this case the goal variable is…
5 runs0 likes0 downloads0 reach0 impact
16599 instances - 19 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1176 runs0 likes0 downloads0 reach0 impact
16599 instances - 19 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 18 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
16598 instances - 11 features - classes - 329 missing values
CD4 count prediction date
0 runs0 likes0 downloads0 reach0 impact
16484 instances - 62 features - classes - 0 missing values
nfl_games
0 runs0 likes0 downloads0 reach0 impact
16274 instances - 12 features - classes - 0 missing values
## **Meta-Album Cars Dataset (Extended)** The original Cars dataset (https://ai.stanford.edu/~jkrause/cars/car_dataset.html) was collected in 2013, and it contains more than 16 000 images from 196…
0 runs0 likes0 downloads0 reach1 impact
16185 instances - 3 features - 196 classes - 16185 missing values
No data.
2 runs0 likes0 downloads0 reach0 impact
16000 instances - 27649 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 27649 features - 2 classes - 0 missing values
A subset of the 3D dataset from Princeton\'s COS 429 Computer Vision course. The dataset consists of 40 models organised into 4 classes of 10 objects each.
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 4 features - classes - 0 missing values
Remote sensing dataset (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Remote sensing dataset (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
100 Sports Dataset (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Textures dataset from Describable Textures Dataset (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Airplanes dataset with different aiplane models (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Mamals dataset for image classification (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Birds dataset for image classification (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Dogs dataset with different breeds of dogs (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Plant disease dataset (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Healthy Medicinal Leaf (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Plants Dataset with different species of plants (stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
Insects dataset for Insect Pest Recognition (Stylized)
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 7 features - 20 classes - 0 missing values
This dataset includes 15856 records of individuals who are or were subscribers to this newspaper. Datasets contain demographic information like HH Income which stands for the household income, home…
0 runs0 likes0 downloads0 reach0 impact
15855 instances - 19 features - 2 classes - 1430 missing values
Indoor scene recognition is a challenging open problem in high level vision. Most scene recognition models that work well for outdoor scenes perform poorly in the indoor domain. The main difficulty is…
0 runs0 likes0 downloads0 reach0 impact
15620 instances - 3 features - 67 classes - 0 missing values
Indoor scene recognition is a challenging open problem in high level vision. Most scene recognition models that work well for outdoor scenes perform poorly in the indoor domain. The main difficulty is…
1 runs0 likes0 downloads0 reach0 impact
15620 instances - 3 features - 67 classes - 0 missing values
Test dataset
0 runs0 likes0 downloads0 reach0 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
3 runs0 likes0 downloads0 reach0 impact
15547 instances - 61 features - 2 classes - 280 missing values
Test dataset
0 runs0 likes0 downloads0 reach0 impact
15547 instances - 61 features - 0 classes - 280 missing values
Test dataset
0 runs0 likes0 downloads0 reach0 impact
15547 instances - 61 features - 0 classes - 280 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
109967 runs0 likes0 downloads0 reach0 impact
15545 instances - 6 features - 2 classes - 0 missing values
## **Meta-Album Fungi Dataset (Extended)** Meta-Album Fungi dataset is created by sampling the Danish Fungi 2020 dataset(https://arxiv.org/abs/2103.10107), itself a sampling of the Atlas of Danish…
0 runs0 likes0 downloads0 reach1 impact
15122 instances - 3 features - 25 classes - 0 missing values
## **Meta-Album Subcellular Human Protein Dataset (Extended)** This dataset is a subset of the Subcellular dataset in the Protein Atlas project(https://www.proteinatlas.org/). The original dataset,…
0 runs0 likes0 downloads0 reach1 impact
15050 instances - 3 features - 21 classes - 15050 missing values
San Francisco International Airport Report on Monthly Passenger Traffic Statistics by Airline. Airport data is seasonal in nature, therefore any comparative analyses should be done on a…
0 runs0 likes0 downloads0 reach0 impact
15007 instances - 16 features - classes - 108 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
624 runs0 likes0 downloads0 reach0 impact
15000 instances - 49 features - 2 classes - 0 missing values
This is a commercial application described in Weiss & Indurkhya (1995). The data describes a telecommunication problem. No further information is available. Characteristics: (10000+5000) cases, 49…
5 runs0 likes0 downloads0 reach0 impact
15000 instances - 49 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 27 features - 0 classes - 0 missing values
exercises
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 8 features - classes - 0 missing values
exercises
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 8 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 27 features - 0 classes - 0 missing values
% Title: Flora % Source: https://automl.chalearn.org/data % % Dataset from the first ChaLearn AutoML challenge (2014). % Only the training data is included, as there were no labels for validation and…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 200001 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 27 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 27 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 27 features - 0 classes - 0 missing values
About Dataset This data set includes15K Fifa20 Players with 15+ features and their images , including their position, age, and Country, and many more. It can be used for learning Statistics,…
0 runs0 likes0 downloads0 reach0 impact
14999 instances - 74 features - classes - 14009 missing values
All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was detected via a camera during the EEG measurement…
166511 runs0 likes0 downloads0 reach0 impact
14980 instances - 15 features - 2 classes - 0 missing values
This data set contains demographic, geographic, and severity information for all confirmed and probable cases reported to and managed by Toronto Public Health since the first case was reported in…
0 runs0 likes0 downloads0 reach0 impact
14911 instances - 17 features - classes - 1212 missing values
Context This is a complete set of Player Data from eFootball PES 2021 video game , the latest of the football franchise by KONAMI. It can be useful for generating insights about trends of skills and…
0 runs0 likes0 downloads0 reach0 impact
14693 instances - 120 features - classes - 6844 missing values
Context There's a story behind every dataset and here's your opportunity to share yours. Content What's inside is more than just rows and columns. Make it easy for others to get started by describing…
0 runs0 likes0 downloads0 reach0 impact
14640 instances - 15 features - classes - 61973 missing values
Context As a fan of anime (Japanese animated media), I have always wanted to have information about all the anime I can get my hands on. Content This data was scraped from anime-planet on June 15,…
0 runs0 likes0 downloads0 reach0 impact
14578 instances - 18 features - classes - 28902 missing values
Finding which features are most important to job profitability
0 runs0 likes0 downloads0 reach0 impact
14480 instances - 30 features - 0 classes - 0 missing values
3D Location Estimation using RSSI of WLAN dataset.The 3D Location Estimation Using RSSI of Wireless LAN challengeaims to develop an AI/ML-based localization algorithm that canaccurately estimate the…
0 runs0 likes0 downloads0 reach0 impact
14400 instances - 16 features - 0 classes - 0 missing values
3D Location Estimation using RSSI of WLAN dataset.The 3D Location Estimation Using RSSI of Wireless LAN challengeaims to develop an AI/ML-based localization algorithm that canaccurately estimate the…
0 runs0 likes0 downloads0 reach0 impact
14400 instances - 16 features - classes - 0 missing values
__Major changes w.r.t. version 1: changed binary features to data type factor.__ Dataset from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch), which consisted of…
0 runs0 likes0 downloads0 reach0 impact
14395 instances - 217 features - classes - 0 missing values
Datasets from the Agnostic Learning vs. Prior Knowledge Challenge (http://www.agnostic.inf.ethz.ch) Dataset from: http://www.agnostic.inf.ethz.ch/datasets.php Modified by TunedIT (converted to ARFF…
486 runs0 likes0 downloads0 reach0 impact
14395 instances - 109 features - 2 classes - 0 missing values
Om algos te testen
74 runs0 likes0 downloads0 reach0 impact
14240 instances - 31 features - 2 classes - 0 missing values
Author: Marius Lindauer Date: 27.02.2014 These data set was generated for a publication about claspfolio 2.0, i.e., an algorithm selector for ASP. The algorithm portfolio of clasp (2.1.4)…
0 runs0 likes0 downloads0 reach0 impact
14234 instances - 143 features - 0 classes - 200838 missing values
Remote sensing dataset
0 runs0 likes0 downloads0 reach0 impact
14000 instances - 2 features - 20 classes - 0 missing values
The dataset contains information on 13,932 single-family homes sold in Miami in 2016. Besides publicly available information, the dataset creator Steven C. Bourassa has added distance variables,…
0 runs0 likes0 downloads0 reach0 impact
13932 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
13932 instances - 14 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
13932 instances - 14 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
13932 instances - 14 features - 0 classes - 0 missing values
The dataset contains information on 13,932 single-family homes sold in Miami . Content. The goal is to predict the sale price. The dataset contains the following columns: * PARCELNO: unique identifier…
0 runs0 likes0 downloads0 reach0 impact
13932 instances - 16 features - 0 classes - 0 missing values
A Vergara, S Vembu, T Ayhan, M Ryan, M Homer, R Huerta. "Chemical gas sensor drift compensation using classifier ensembles." Sensors and Actuators B: Chemical 166 (2012): 320-329. I Rodriguez-Lujan, J…
68 runs0 likes0 downloads0 reach0 impact
13910 instances - 130 features - 6 classes - 0 missing values
### Description Gas Sensor Array Drift Dataset Data Set ### Sources ``` (a) Creators: Alexander Vergara (vergara '@' ucsd.edu) BioCircutis Institute University of California San Diego San Diego,…
18522 runs0 likes0 downloads0 reach0 impact
13910 instances - 129 features - 6 classes - 0 missing values
Context Meat consumption is related to living standards, diet, livestock production and consumer prices, as well as macroeconomic uncertainty and shocks to GDP. Compared to other commodities, meat is…
0 runs0 likes0 downloads0 reach0 impact
13760 instances - 5 features - classes - 0 missing values
This data set addresses a control problem, namely flying a F16 aircraft. The attributes describe the status of the aeroplane, while the goal is to predict the control action on the ailerons of the…
0 runs0 likes0 downloads0 reach0 impact
13750 instances - 41 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
602 runs0 likes0 downloads0 reach0 impact
13750 instances - 41 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
13750 instances - 34 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
13750 instances - 34 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
13750 instances - 34 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
13750 instances - 34 features - 0 classes - 0 missing values