OpenML
Filter results by:
test
0 runs0 likes0 downloads0 reach0 impact
97 instances - 69 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
9792 instances - 65 features - classes - 8776 missing values
The data contains information on 21263 superconductors. The first 81 columns contain extracted features and the 82nd column contains the critical temperature which is used as the target variable. The…
0 runs0 likes0 downloads0 reach0 impact
21263 instances - 82 features - 0 classes - 0 missing values
Sell by day
0 runs0 likes0 downloads0 reach0 impact
391 instances - 5 features - classes - 0 missing values
Data shows the downlink goodput for unicast and multicast transmissions with different group sizes and network loads on an IEEE 802.11ac network simulated on NS-3. Different IEEE 802.11aa GATS are…
0 runs0 likes0 downloads0 reach0 impact
4046 instances - 34 features - 0 classes - 0 missing values
33
0 runs0 likes0 downloads0 reach0 impact
8783 instances - 17 features - classes - 4314 missing values
Data shows the downlink goodput for unicast and multicast transmissions with different group sizes and network loads on an IEEE 802.11ac network simulated on NS-3. Different IEEE 802.11aa GATS are…
0 runs0 likes0 downloads0 reach0 impact
4046 instances - 34 features - 0 classes - 0 missing values
Data shows the downlink goodput for unicast and multicast transmissions with different group sizes and network loads on an IEEE 802.11ac network simulated on NS-3. Different IEEE 802.11aa GATS are…
0 runs0 likes0 downloads0 reach0 impact
4881 instances - 34 features - 0 classes - 691 missing values
Data shows the downlink goodput for unicast and multicast transmissions with different group sizes and network loads on an IEEE 802.11ac network simulated on NS-3. Different IEEE 802.11aa GATS are…
0 runs0 likes0 downloads0 reach0 impact
4322 instances - 34 features - 0 classes - 0 missing values
Data shows the downlink goodput for unicast and multicast transmissions with different group sizes and network loads on an IEEE 802.11ac network simulated on NS-3. Different IEEE 802.11aa GATS are…
0 runs0 likes0 downloads0 reach0 impact
5296 instances - 34 features - 0 classes - 1224 missing values
17StudentTimeManagement
0 runs0 likes0 downloads0 reach0 impact
125 instances - 21 features - classes - 57 missing values
18ProductivityPrediction
0 runs0 likes0 downloads0 reach0 impact
1197 instances - 15 features - 0 classes - 506 missing values
22SafetyBehaviouDuringCOVID-19
0 runs0 likes0 downloads0 reach0 impact
515 instances - 26 features - classes - 0 missing values
1
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - classes - 0 missing values
26InternationalstudentsinChina-Province
0 runs0 likes0 downloads0 reach0 impact
12 instances - 2 features - classes - 0 missing values
17OnlineShoppersPurchasingIntention
0 runs0 likes0 downloads0 reach0 impact
12330 instances - 18 features - classes - 0 missing values
18ProductivityPrediction
0 runs0 likes0 downloads0 reach0 impact
1197 instances - 15 features - classes - 506 missing values
19SemesterResult
0 runs0 likes0 downloads0 reach0 impact
178 instances - 10 features - classes - 198 missing values
24InternationalstudentsinChina-Continent
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - classes - 0 missing values
25InternationalstudentsinChina-Country
0 runs0 likes0 downloads0 reach0 impact
15 instances - 3 features - classes - 0 missing values
19Realestatevaluation
0 runs0 likes0 downloads0 reach0 impact
414 instances - 8 features - classes - 0 missing values
wine
0 runs0 likes0 downloads0 reach0 impact
150930 instances - 10 features - classes - 174477 missing values
admet-bbb training data
0 runs0 likes0 downloads0 reach0 impact
2054 instances - 3025 features - classes - 0 missing values
Data shows the downlink goodput for unicast and multicast transmissions with different group sizes and network loads on an IEEE 802.11ac network simulated on NS-3. Different IEEE 802.11aa GATS are…
0 runs0 likes0 downloads0 reach0 impact
3782 instances - 34 features - 0 classes - 0 missing values
#StudentPerfromance
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - 2 classes - 0 missing values
Data shows the downlink goodput for unicast and multicast transmissions with different group sizes and network loads on an IEEE 802.11ac network simulated on NS-3. Different IEEE 802.11aa GATS are…
0 runs0 likes0 downloads0 reach0 impact
2824 instances - 34 features - 0 classes - 432 missing values
wine-quality data for testing in automl-benchmark
0 runs0 likes0 downloads0 reach0 impact
1143 instances - 13 features - classes - 0 missing values
//Add the description.md of the data file Graph_Inference_Dataset Goudet, Olivier, 2017, "Graph inference datasets. Replication Data for: "Learning Functional Causal Models with Generative Neural…
0 runs0 likes0 downloads0 reach0 impact
500 instances - 101 features - classes - 0 missing values
//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…
0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values
//Add the description.md of the data file Graph_Inference_Dataset Goudet, Olivier, 2017, "Graph inference datasets. Replication Data for: "Learning Functional Causal Models with Generative Neural…
0 runs0 likes0 downloads0 reach0 impact
500 instances - 101 features - classes - 0 missing values
()[]
0 runs0 likes0 downloads0 reach0 impact
2845342 instances - 46 features - classes - 3414349 missing values
Description This is a countrywide car accident dataset, which covers 49 states of the USA. The accident data are collected from February 2016 to Dec 2020, using two APIs that provide streaming traffic…
0 runs0 likes0 downloads0 reach0 impact
2845342 instances - 46 features - classes - 3414349 missing values
Context Craigslist is the world's largest collection of used vehicles for sale, yet it's very difficult to collect all of them in the same place. I built a scraper for a school project and expanded…
0 runs0 likes0 downloads0 reach0 impact
426880 instances - 25 features - classes - 1655336 missing values
aasdasda
0 runs0 likes0 downloads0 reach0 impact
1085 instances - 1 features - classes - 0 missing values
//Add the description.md of the data file Graph_Inference_Dataset Goudet, Olivier, 2017, "Graph inference datasets. Replication Data for: "Learning Functional Causal Models with Generative Neural…
0 runs0 likes0 downloads0 reach0 impact
500 instances - 101 features - classes - 0 missing values
//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…
0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values
Content The dataset is an amalgamation of: data that was scraped, which comprised a comprehensive list of movies available on various streaming platforms IMDb dataset Inspiration Which streaming…
0 runs0 likes0 downloads0 reach0 impact
9515 instances - 10 features - classes - 4184 missing values
Context Interested in the Indian startup ecosystem just like me Wanted to know what type of startups are getting funded in the last few years Wanted to know who are the important investors Wanted to…
0 runs0 likes0 downloads0 reach0 impact
3044 instances - 10 features - classes - 4900 missing values
Context The dataset comes from one of the most important parts of a mining process: a flotation plant The main goal is to use this data to predict how much impurity is in the ore concentrate. As this…
0 runs0 likes0 downloads0 reach0 impact
737453 instances - 24 features - classes - 0 missing values
Context Craigslist is the world's largest collection of used vehicles for sale, yet it's very difficult to collect all of them in the same place. I built a scraper for a school project and expanded…
0 runs0 likes0 downloads0 reach0 impact
426880 instances - 25 features - classes - 1655336 missing values
Hello My name is Ben Roshan D, doing MBA in Business Analytics at Jain University Bangalore . We have practical sessions in Python,R as subjects. Faculties provide us with such data sets to work on…
0 runs0 likes0 downloads0 reach0 impact
215 instances - 15 features - classes - 67 missing values
Content This dataset is a record of 7 common different fish species in fish market sales. With this dataset, a predictive model can be performed using machine friendly data and estimate the weight of…
0 runs0 likes0 downloads0 reach0 impact
159 instances - 7 features - 0 classes - 0 missing values
//Add the description.md of the data file pair0004 Cause-effect is a growing database with two-variable cause-effect pairs created at Max-Planck-Institute for Biological Cybernetics in Tuebingen,…
0 runs0 likes0 downloads0 reach0 impact
348 instances - 2 features - classes - 0 missing values
//Add the description.md of the data file pair0005 Cause-effect is a growing database with two-variable cause-effect pairs created at Max-Planck-Institute for Biological Cybernetics in Tuebingen,…
0 runs0 likes0 downloads0 reach0 impact
4176 instances - 2 features - classes - 0 missing values
//Add the description.md of the data file PhosphoproteinChallenge_DREAM3 Human normal and cancer hepatocytes (cell line HepG2s) were treated with 7 stimuli (Table 1a) that are relevant to hepatocyte…
0 runs0 likes0 downloads0 reach0 impact
272 instances - 21 features - classes - 0 missing values
//Add the description.md of the data file pair0001 Information for pairs0001: DWD data (Deutscher Wetterdienst) data was taken at 349 stations taken from…
0 runs0 likes0 downloads0 reach0 impact
348 instances - 2 features - classes - 0 missing values
//Add the description.md of the data file pair0002 Cause-effect is a growing database with two-variable cause-effect pairs created at Max-Planck-Institute for Biological Cybernetics in Tuebingen,…
0 runs0 likes0 downloads0 reach0 impact
348 instances - 2 features - classes - 0 missing values
//Add the description.md of the data file pair0003 Cause-effect is a growing database with two-variable cause-effect pairs created at Max-Planck-Institute for Biological Cybernetics in Tuebingen,…
0 runs0 likes0 downloads0 reach0 impact
348 instances - 2 features - classes - 0 missing values
Context This dataset was created by our in house Web Scraping and Data Mining teams at PromptCloud and DataStock. You can download the full dataset here. This sample contains 30K records. Content This…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 68 features - classes - 129439 missing values
Overview This dataset contains 3 million Sudoku puzzles and their solutions. The level of difficulty varies -- some can be solved easily by a beginner, while others will challenge experienced solvers.…
0 runs0 likes0 downloads0 reach0 impact
3000000 instances - 4 features - 0 classes - 0 missing values
Context Will you be able to identify genuine and conterfeit banknotes, even if half of the data is conterfeit? Perfect for testing different outlier detection algorithms. Content The dataset includes…
0 runs0 likes0 downloads0 reach0 impact
200 instances - 7 features - 0 classes - 0 missing values
Context Climate change is forcing cities to re-imaging their transportation infrastructure. Shared mobility concepts, such as car sharing, bike sharing or scooter sharing become more and more popular.…
0 runs0 likes0 downloads0 reach0 impact
2922 instances - 29 features - classes - 51510 missing values
Context You can find a detailed weather data (2015-2020) of Ulaanbaatar, capital city of Mongolia. Content Data is including the timestamps (UTC) and timely basis data of weather related features,…
0 runs0 likes0 downloads0 reach0 impact
49184 instances - 20 features - classes - 138 missing values
Context Information about more than 4600 companies tradable on Robinhood website Source The information is scraped from Robinhood website Inspiration The dataset contains useful information about…
0 runs0 likes0 downloads0 reach0 impact
4610 instances - 33 features - classes - 8696 missing values
This data was extracted from the 1994 Census bureau database by Ronny Kohavi and Barry Becker (Data Mining and Visualization, Silicon Graphics). A set of reasonably clean records was extracted using…
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 15 features - 2 classes - 0 missing values
What is it ? This dataset is a record of 3.5 Million+ US Domestic Flights from 1990 to 2009. It has been taken from OpenFlights website which have a huge database of different travelling mediums…
0 runs0 likes0 downloads0 reach0 impact
3606803 instances - 15 features - classes - 27522 missing values
This data set contains demographic, geographic, and severity information for all confirmed and probable cases reported to and managed by Toronto Public Health since the first case was reported in…
0 runs0 likes0 downloads0 reach0 impact
14911 instances - 17 features - classes - 1212 missing values
Context The dataset was collated as a casual project using data from Genius and Big Hit. Currently contains 18 albums (check section "Albums in dataset" below for more details). Columns id (int) :…
0 runs0 likes0 downloads0 reach0 impact
231 instances - 16 features - classes - 439 missing values
Context The unprocessed dataset was acquired from UCI Machine Learning organisation. This dataset is preprocessed by me, originally from the National Institute of Diabetes and Digestive and Kidney…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
Context Bus Breakdown and Delays You can find the road where the traffic was heavy for the New York City Taxi Trip Duration playground. Content The Bus Breakdown and Delay system collects information…
0 runs0 likes0 downloads0 reach0 impact
147972 instances - 20 features - classes - 170487 missing values
Context To build a AI to predict the stock price of the Dollar currency on IBOVESPA I had to make this dataset. All information collected here is from a standard graphic for stock prices. Content The…
0 runs0 likes0 downloads0 reach0 impact
33077 instances - 18 features - classes - 0 missing values
Context Data set containing results from 2019 Ironman World Championship in Kona, Hawaii to help answer the question of which countries produce the fastest overall finish times, as well as fastest…
0 runs0 likes0 downloads0 reach0 impact
2434 instances - 14 features - classes - 766 missing values
Context As you all know that, as per the observation of economists, according to the current trend, it seems that the yellow metal is performing better as an investment option in comparison to mutual…
0 runs0 likes0 downloads0 reach0 impact
4971 instances - 6 features - classes - 0 missing values
Context This dataset deals with pollution in the U.S. Pollution in the U.S. has been well documented by the U.S. EPA but it is a pain to download all the data and arrange them in a format that…
0 runs0 likes0 downloads0 reach0 impact
1746661 instances - 29 features - classes - 1746230 missing values
Context Manufacturing process feature selection and categorization Content Abstract: Data from a semi-conductor manufacturing process Data Set Characteristics: Multivariate Number of Instances: 1567…
0 runs0 likes0 downloads0 reach0 impact
1567 instances - 592 features - classes - 41951 missing values
Introduction The idea behind this dataset is to see how the number of people and the home size affects the monthly electricity consumption in the household. Column decription: Column Explanation…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 10 features - classes - 0 missing values
Context Though it doesn't rain often in Los Angeles, the city has various means of capturing rainfall to increase our local water supply. This dataset shows how much water we've capturing cumulatively…
0 runs0 likes0 downloads0 reach0 impact
296 instances - 7 features - classes - 0 missing values
Context This data set was created to help Kaggle users in the New Your City Taxi Trip Duration competition. New features were generated using Wolfram Mathematica system. Hope that this data set will…
0 runs0 likes0 downloads0 reach0 impact
2083778 instances - 24 features - classes - 6810 missing values
Clash Royale Ladder Battles dataset I expect to update this dataset daily I encourage you to make an EDA or even train a model to predict if I win or lose! Notes: "op" means opponent my/optroops,…
0 runs0 likes0 downloads0 reach0 impact
827 instances - 147 features - classes - 0 missing values
Overwatch is a team-based multiplayer first-person shooter video game developed and published by Blizzard Entertainment, which released on May 24, 2016 for PlayStation 4, Xbox One, and Windows.…
0 runs0 likes0 downloads0 reach0 impact
3299 instances - 47 features - classes - 49733 missing values
This is a historical data of HangSeng Futures Index based in Hong Kong. For non traders, the data is a time-series (sequential flow of numbers) describing the HangSeng Futures Index of HongKong. Every…
0 runs0 likes0 downloads0 reach0 impact
87645 instances - 8 features - classes - 0 missing values
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
1692056 instances - 26 features - classes - 9275842 missing values
Dataset includes date and time, opponent, score, rushing statistics, passing statistics, turnovers, Nebraska penalties, point spread, and weather information. All games are included. Penalty data is…
0 runs0 likes0 downloads0 reach0 impact
733 instances - 31 features - classes - 1009 missing values
Citation Request: This dataset is public available for research. The details are described in [Cortez et al., 2009]. Please include this citation if you plan to use this database: P. Cortez, A.…
0 runs0 likes0 downloads0 reach0 impact
6497 instances - 14 features - classes - 0 missing values
Context This data is the result of using neural networks and reinforcement learning to simulate the board game "Machi Koro". Here is the source code for the AI and simulation:…
0 runs0 likes0 downloads0 reach0 impact
614584 instances - 86 features - classes - 0 missing values
The datasets talk about Stock Market, in two Industry and each Industry has ten different companies with six columns and 600 rows ,in the same date for all the 20 companies the period from 2020-02-06…
0 runs0 likes0 downloads0 reach0 impact
600 instances - 7 features - classes - 0 missing values
Context Fake news has become one of the biggest problems of our age. It has serious impact on our online as well as offline discourse. One can even go as far as saying that, to date, fake news poses a…
0 runs0 likes0 downloads0 reach0 impact
6335 instances - 4 features - classes - 0 missing values
The following dataset contains data on blog posts from MarginalRevolution.com. For posts from Jan. 1, 2010 to 9/17/2016, the following attributes are gathered. Author Name Post Title Post Date Post…
0 runs0 likes0 downloads0 reach0 impact
12820 instances - 22 features - classes - 81 missing values
Context The dataset contains reviews from google playstore on snapchat. With the sentiment analysis, we can check for the users' adoption of andriod version of snapchat, which has been improved…
0 runs0 likes0 downloads0 reach0 impact
32875 instances - 4 features - classes - 0 missing values
Context Soy beans are a major agricultural crop. Content Compilation of Soybean prices and factors that effect soybean prices. Daily data. Temp columns are daily temperatures of the major U.S. grow…
0 runs0 likes0 downloads0 reach0 impact
21828 instances - 48 features - classes - 0 missing values
Context Many kids' products are thought to contain dangerous metals. Due to this it is important to make sure that children can enjoy their toys while staying as safe as possible. Content In this…
0 runs0 likes0 downloads0 reach0 impact
1635 instances - 10 features - classes - 0 missing values
Title: 1985 Auto Imports Database Source Information: -- Creator/Donor: Jeffrey C. Schlimmer (Jeffrey.Schlimmera.gp.cs.cmu.edu) -- Date: 19 May 1987 -- Sources: 1) 1985 Model Import Car and Truck…
0 runs0 likes0 downloads0 reach0 impact
205 instances - 26 features - classes - 59 missing values
Context This Dataset, consist of Tweets regarding Machine Learning, posted by Prof.Andrew Ng, Co-founder of Coursera, and an Adjunct Professor of Computer Science at Stanford University. His machine…
0 runs0 likes0 downloads0 reach0 impact
1234 instances - 8 features - classes - 1 missing values
These data are the results of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of 13 constituents found…
0 runs0 likes0 downloads0 reach0 impact
178 instances - 14 features - classes - 0 missing values
Context There's a story behind every dataset and here's your opportunity to share yours. Imdb-data is dataset for various movies gathered together Content What's inside is more than just rows and…
0 runs0 likes0 downloads0 reach0 impact
1003 instances - 12 features - classes - 196 missing values
Context Inspired by the New York City Taxi Trip Duration playground I created a dataset using the publicly available data from this link). Citi Bike is a bike sharing service available in New York…
0 runs0 likes0 downloads0 reach0 impact
4500000 instances - 8 features - classes - 0 missing values
Google Play Store Google Play, formerly Android Market, is a digital distribution service operated and developed by Google. It serves as the official app store for certified devices running on the…
0 runs0 likes0 downloads0 reach0 impact
12495 instances - 12 features - classes - 15602 missing values
Context The International Chess Federation (FIDE) governs international chess competition. FIDE used Elo rating system for calculating the relative skill levels of players. Content The dataset…
0 runs0 likes0 downloads0 reach0 impact
977687 instances - 10 features - classes - 3994218 missing values
Context This Data set Contains the values of Stock of Amazon which dates between 15 July ,2019 to 15 July,2020 Content The Data set contains 7 different columns that includes Date, The opening value…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 8 features - classes - 23 missing values
Source: UCI Machine Learning Repository Content A dataset of manually-curated BCF for 779 chemicals was used to determine the mechanisms of bioconcentration, i.e. to predict whether a chemical: (1) is…
0 runs0 likes0 downloads0 reach0 impact
779 instances - 14 features - classes - 0 missing values
DESCRIPTION Problem Statement NIDDK (National Institute of Diabetes and Digestive and Kidney Diseases) research creates knowledge about and treatments for the most chronic, costly, and consequential…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
A modified version of the European Soccer Database by Hugo Mathien (https://www.kaggle.com/hugomathien). New players' performance indicators have been created from the original dataset. Data features:…
0 runs0 likes0 downloads0 reach0 impact
25979 instances - 31 features - classes - 35756 missing values
Context Electoral integrity refers to international standards and global norms governing the appropriate conduct of elections. These standards have been endorsed in a series of authoritative…
0 runs0 likes0 downloads0 reach0 impact
726 instances - 85 features - classes - 4001 missing values
Coronavirus Country Profiles We built 207 country profiles which allow you to explore the statistics on the coronavirus pandemic for every country in the world. In a fast-evolving pandemic it is not a…
0 runs0 likes0 downloads0 reach0 impact
170646 instances - 66 features - classes - 5082293 missing values
Context A compilation of all the development related courses ( 10 thousand courses) which are available on Udemy's website. Under the development category, there are courses from Web Development, Data…
0 runs0 likes0 downloads0 reach0 impact
9932 instances - 17 features - classes - 336 missing values
Context These data come from a camera that is part of the Telraam device which makes counting cameras available to interested citizens. https://www.telraam.net/fr/what-is-telraam This camera is…
0 runs0 likes0 downloads0 reach0 impact
970 instances - 24 features - classes - 2928 missing values
Context This dataset was created by our in house teams at PromptCloud(https://www.promptcloud.com/) and DataStock(https://datastock.shop/). We have about 5K samples in this dataset. You can download…
0 runs0 likes0 downloads0 reach0 impact
741 instances - 8 features - classes - 1575 missing values
Context https://www.kaggle.com/c/birdsong-recognition Content This is information captured about birds recording from India Acknowledgements We wouldn't be here without the help of many contributors…
0 runs0 likes0 downloads0 reach0 impact
13654 instances - 26 features - classes - 26863 missing values
Context This dataset is generated after an epxloratory analysis of data provided by Carbon Disclosure Project in the following competition: CDP: Unlocking Climate Solutions. It contains a compilation…
0 runs0 likes0 downloads0 reach0 impact
615 instances - 58 features - classes - 9612 missing values
About Dataset This data set includes15K Fifa20 Players with 15+ features and their images , including their position, age, and Country, and many more. It can be used for learning Statistics,…
0 runs0 likes0 downloads0 reach0 impact
14999 instances - 74 features - classes - 14009 missing values