OpenML
Filter results by:
No data.
0 runs0 likes0 downloads0 reach0 impact
37 instances - 19 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
8378 instances - 123 features - classes - 18372 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
841 instances - 74 features - classes - 0 missing values
tesl dataset about L
0 runs0 likes0 downloads0 reach0 impact
150000 instances - 8 features - classes - 0 missing values
#test data for mlp
0 runs0 likes0 downloads0 reach0 impact
200 instances - 12 features - classes - 0 missing values
PM 2.5 datasetd
0 runs0 likes0 downloads0 reach0 impact
43800 instances - 10 features - classes - 0 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2778 instances - 28 features - 10 classes - 1737 missing values
Survey to know if people self-identify as Midwesterners.
0 runs0 likes0 downloads0 reach0 impact
2494 instances - 28 features - 9 classes - 99 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
841 instances - 74 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
5472 instances - 15 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
252 instances - 14 features - classes - 0 missing values
Context The Philippine Statistics Authority (PSA) spearheads the conduct of the Family Income and Expenditure Survey (FIES) nationwide. The survey, which is undertaken every three (3) years, is aimed…
0 runs0 likes0 downloads0 reach0 impact
41544 instances - 60 features - classes - 15072 missing values
Context Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 81 features - classes - 1396 missing values
Context of dataset There has been a rise in the demand of online delivery in the metropolitan cities such as Bangalore in India. The question about why this increase in the demand has always been a…
0 runs0 likes0 downloads0 reach0 impact
388 instances - 55 features - classes - 0 missing values
Mexico COVID-19 clinical data This dataset contains the results of real-time PCR testing for COVID-19 in Mexico as reported by the [General Directorate of…
0 runs0 likes0 downloads0 reach0 impact
263007 instances - 41 features - classes - 6 missing values
Context This dataset contains 100k dielectron events in the invariant mass range 2-110 GeV for use in outreach and education. These data were selected for use in education and outreach and contain a…
0 runs0 likes0 downloads0 reach0 impact
Context CS:GO is a tactical shooter, where two teams (CT and Terrorist) play for a best of 30 rounds, with each round being 1 minute and 55 seconds. There are 5 players on each team (10 in total) and…
0 runs0 likes0 downloads0 reach0 impact
122410 instances - 97 features - classes - 0 missing values
Introduction The dataset contains all the entire apartments located in Milan (N = 9322). This public dataset is part of Airbnb, and the original source can be found on this website. Dataset Creation…
0 runs0 likes0 downloads0 reach0 impact
9322 instances - 61 features - classes - 0 missing values
Context I am currently writing a seminar paper about content summarization in Twitter networks. One of the Frameworks I read about uses a peak detection algorithm to cluster tweets by topic-aware peak…
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 52 features - classes - 0 missing values
This dataset is based on dipam7/student-grade-prediction Dataset General information This data approach student achievement in secondary education of two Portuguese schools. The data attributes…
0 runs0 likes0 downloads0 reach0 impact
395 instances - 30 features - 0 classes - 0 missing values
Voice Gender Gender Recognition by Voice and Speech Analysis This database was created to identify a voice as male or female, based upon acoustic properties of the voice and speech. The dataset…
0 runs0 likes0 downloads0 reach0 impact
3168 instances - 21 features - classes - 0 missing values
Context The official Goodread's API limits retrievable data, so I decided to scrape the actual HTTP pages and grab additional details on each book. Content Books are scraped from a list titles the…
0 runs0 likes0 downloads0 reach0 impact
52199 instances - 31 features - classes - 285951 missing values
Context A person makes a doctor appointment, receives all the instructions and no-show. Who to blame? If this help you studying or working, please dont forget to upvote :). Reference to Joni Hoppen…
0 runs0 likes0 downloads0 reach0 impact
110527 instances - 13 features - 2 classes - 0 missing values
Context This is Alcohol Consumption in Russia (1998-2016) Dataset. It contains values of consumption for wine, beer, vodka, brandy and champagne. Content Dataset has 1615 rows and 7 columns. Keys for…
0 runs0 likes0 downloads0 reach0 impact
1615 instances - 7 features - classes - 311 missing values
Context This is historical data on cryptocurrency tradings for the period from 2016-01-01 to 2021-02-21. If you enjoy this dataset please upvote so I can see it is popular and I need to update it.…
0 runs1 likes0 downloads1 reach0 impact
2382643 instances - 17 features - classes - 4862194 missing values
Introduction TMDB.org is a crowd-sourced movie information database used by many film-related consoles, sites and apps, such as XBMC, MythTV and Plex. Dozens of media managers, mobile apps and social…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 6 features - classes - 30 missing values
Context Since Amazon Echo Dot 2 has been the best selling Alexa product, we decided to extract the reviews posted on Amazon for this device. This particular dataset contains reviews posted in…
0 runs0 likes0 downloads0 reach0 impact
6855 instances - 10 features - 0 classes - 13903 missing values
Context The objective of this dataset is to create a chess engine through machine learning. In this first part we will first predict the pieces to be moved depending on the position of the chessboard…
0 runs0 likes0 downloads0 reach0 impact
2632753 instances - 66 features - classes - 0 missing values
This dataset contains data about the physical and chemical properties of the Li-ion silicate cathodes. These properties can be useful to predict the class of a Li-ion battery. These batteries can be…
0 runs0 likes0 downloads0 reach0 impact
339 instances - 11 features - classes - 0 missing values
Why 2017 to 2020? The BitCoin data is Limited to these 3 years because as everyone Knows: Dec 2017 was the time when BitCoin Prices Skyrocketed! Hence, this duration is Perfect for Predicting future…
0 runs0 likes0 downloads0 reach0 impact
1097 instances - 7 features - classes - 0 missing values
Dataset Information This dataset contains information on default payments, demographic factors, credit data, history of payment, and bill statements of credit card clients in Taiwan from April 2005 to…
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 24 features - classes - 0 missing values
This data was extracted from the 1994 Census bureau database by Ronny Kohavi and Barry Becker (Data Mining and Visualization, Silicon Graphics). A set of reasonably clean records was extracted using…
0 runs0 likes1 downloads1 reach0 impact
32561 instances - 15 features - classes - 4262 missing values
Context Melbourne real estate is BOOMING. Can you find the insight or predict the next big trend to become a real estate mogul or even harder, to snap up a reasonably priced 2-bedroom unit? Content…
0 runs0 likes0 downloads0 reach0 impact
13580 instances - 21 features - classes - 13256 missing values
Context This data set aims to provide a start up for the ones who just started off with deep learning and act as a benchmark. Content The feature set includes: Cement Blast Furnace Slag Fly Ash Water…
0 runs0 likes0 downloads0 reach0 impact
1030 instances - 9 features - classes - 0 missing values
Context We collect recent tweets of Donald Trump, the 45th President of United States of America. The data is collected using tweepy Python package to access Twitter API. Inspiration Study the…
0 runs0 likes0 downloads0 reach0 impact
1984 instances - 16 features - classes - 1914 missing values
Content The dataset contains 517 fires from the Montesinho natural park in Portugal. For each incident weekday, month, coordinates, and the burnt area are recorded, as well as several meteorological…
0 runs0 likes0 downloads0 reach0 impact
517 instances - 13 features - 0 classes - 0 missing values
Context The Office is an American Mockumentary sitcom television series that depicts the everyday lives of office employees in the Scranton, Pennsylvania, branch of the fictional Dunder Mifflin Paper…
0 runs0 likes0 downloads0 reach0 impact
188 instances - 12 features - 0 classes - 159 missing values
Context Imbalanced classes put accuracy out of business. This is a surprisingly common problem in machine learning (specifically in classification), occurring in datasets with a disproportionate ratio…
0 runs0 likes0 downloads0 reach0 impact
1723 instances - 14 features - 0 classes - 0 missing values
This case requires to develop a customer segmentation to define marketing strategy. The sample Dataset summarizes the usage behavior of about 9000 active credit card holders during the last 6 months.…
0 runs0 likes0 downloads0 reach0 impact
8950 instances - 17 features - classes - 314 missing values
This is the Car sales data set which include information about different cars . This data set is being taken from the Analytixlabs for the purpose of prediction In this we have to see two things First…
0 runs0 likes0 downloads0 reach0 impact
157 instances - 16 features - classes - 51 missing values
Business Problem It is often said that great wine is made in the vineyard, not in the winery. However, winemakers have the ability to modify certain aspects of the wines they produce, such as the…
0 runs0 likes0 downloads0 reach0 impact
6497 instances - 13 features - classes - 0 missing values
Content The SP BSE SENSEX (SP Bombay Stock Exchange Sensitive Index), also called the BSE 30 or simply the SENSEX, is a free-float market-weighted stock market index of 30 well-established and…
0 runs0 likes0 downloads0 reach0 impact
73316 instances - 8 features - classes - 1500 missing values
Context As a community of responsible people can we focus to predict these disasters to save lives around. People are already going through worse as if that was not enough if a sudden earthquake comes…
0 runs0 likes0 downloads0 reach0 impact
13729 instances - 22 features - classes - 25869 missing values
Data Set Information: This has been collected using direct questionnaires from the patients of Sylhet Diabetes Hospital in Sylhet, Bangladesh and approved by a doctor. Data Set Information: This has…
0 runs0 likes0 downloads0 reach0 impact
520 instances - 17 features - classes - 0 missing values
Context Cricket based datasets are not readily available for analysis on most of the portals. Hence, an effort to provide test match data for 2200+ tests after cleaning it for very special cases.…
0 runs0 likes0 downloads0 reach0 impact
8291 instances - 14 features - classes - 8291 missing values
Context There's a story behind every dataset and here's your opportunity to share yours. Content What's inside is more than just rows and columns. Make it easy for others to get started by describing…
0 runs0 likes0 downloads0 reach0 impact
14640 instances - 15 features - classes - 61973 missing values
data source https://br.financas.yahoo.com Content Petrleo Brasileiro S.A. - Petrobras (PETR4.SA), from May 19, 2014 to May 16, 2019 Sao Paulo - Sao Paulo Price Postponed. Currency in BRL.…
0 runs0 likes0 downloads0 reach0 impact
1243 instances - 7 features - classes - 0 missing values
Context Real Estate inventory of listings from 2012-2017 Content Includes data for all Real Estate listings in the US, such as, active listings, prices, days on market, price changes, and pending…
0 runs0 likes0 downloads0 reach0 impact
974066 instances - 34 features - classes - 5792129 missing values
Context There's a story behind every dataset and here's your opportunity to share yours. Content What's inside is more than just rows and columns. Make it easy for others to get started by describing…
0 runs0 likes0 downloads0 reach0 impact
4392 instances - 7 features - classes - 0 missing values
Context Content A few years ago, the United States District Court of Houston had a case that arises under Title VII of the Civil Rights Act of 1964, 42 U.S.C. 200e et seq. The plaintiffs in this case…
0 runs0 likes0 downloads0 reach0 impact
261 instances - 10 features - classes - 0 missing values
Context It is important that credit card companies are able to recognize fraudulent credit card transactions so that customers are not charged for items that they did not purchase. Content The…
0 runs0 likes0 downloads0 reach0 impact
284807 instances - 30 features - classes - 0 missing values
Description The data set is provided csv file which provides the following resources that can be used as inputs for model building : A collection of website URLs for 11001 websites. Each sample has 15…
0 runs0 likes0 downloads0 reach0 impact
11000 instances - 15 features - classes - 0 missing values
Having just moved to Boston last fall, I wanted to know whether the severe winter weather I experienced was normal based on historical data, or if there are any patterns to when Boston gets heavy…
0 runs0 likes0 downloads0 reach0 impact
3749 instances - 24 features - classes - 0 missing values
Source: Charles Gaydon This data only contains 5 variables of Productcode, Warehouse, ProductCategory, Date, Order_demand I showed that it is possible, with trivial models, to lower the mean average…
0 runs0 likes0 downloads0 reach0 impact
1048575 instances - 5 features - classes - 11239 missing values
Context This dataset is created for the prediction of future New York Housing Price based on the past 17 years of record. Content Please check the details under the column description.…
0 runs0 likes0 downloads0 reach0 impact
1600202 instances - 10 features - classes - 40496 missing values
Context The World Health Organization (WHO) declared the 201920 coronavirus outbreak a pandemic and a Public Health Emergency of International Concern (PHEIC). Evidence of local transmission of the…
0 runs0 likes0 downloads0 reach0 impact
1010 instances - 7 features - classes - 8 missing values
Context Inspired by the following dataset , we have a collection of data on the first 15 minutes of about 50000 Diamond ranked League of Legends matches scraped using Riot's API. Can you predict their…
0 runs0 likes0 downloads0 reach0 impact
48651 instances - 19 features - classes - 0 missing values
Context This Data set Contains the values of Stock of Amazon which dates between 1st October,2019 to 14 July,2020 Content The Data set contains 7 different columns that includes Date, The opening…
0 runs0 likes0 downloads0 reach0 impact
202 instances - 8 features - classes - 13 missing values
Context Interesting data about accidents with cars on railroad crossing. Content This data covering the period from 1975 to 2016. It has a lot of info about every single accident like time, date,…
0 runs0 likes0 downloads0 reach0 impact
229665 instances - 24 features - classes - 151138 missing values
Context This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle…
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 28 features - classes - 1985 missing values
Context Facebook is a company that literally every kid is aware of. Its a household name. People from various age groups are there on this social media website. It has helped many in connecting with…
0 runs0 likes0 downloads0 reach0 impact
2076 instances - 7 features - classes - 0 missing values
Context Amazon has become a house hold name now and has been around for quite sometime. It comes under the popular FAANG companies and a dream company for many. Today almost anything that you need…
0 runs0 likes0 downloads0 reach0 impact
5852 instances - 7 features - classes - 0 missing values
Context This data set contains measles vaccination rate data for 46,412 schools in 32 states across the US. Content Vaccination rates are for the 2017-201818 school year for the following states:…
0 runs0 likes0 downloads0 reach0 impact
66113 instances - 15 features - classes - 322786 missing values
Context this dataset is containing information about HDFC bank equity share information. Content The dataset containing a total of 7 columns Date 5151 non-null datetime64[ns] Open 5082 non-null…
0 runs0 likes0 downloads0 reach0 impact
489 instances - 14 features - classes - 0 missing values
Context Microstrip Antennas are low-profile antennas applied in high-performance aircraft, spacecraft, satellite and missile applications, where size, weight, performance, ease of installation and…
0 runs0 likes0 downloads0 reach0 impact
572 instances - 13 features - classes - 63 missing values
Context It's always interesting to analyse what's going on in the news but there's no good data avaialable in indian context on kaggle, so I created one. Content Apart from the main content of the…
0 runs0 likes0 downloads0 reach0 impact
1568 instances - 8 features - classes - 33 missing values
Context Using TA-Lib : Technical Analysis Library. Backtest on the SPY Index data, using support and resistance indicators or any other indicator. Content Data contains daily SPY Index: Date Open High…
0 runs0 likes0 downloads0 reach0 impact
6126 instances - 7 features - classes - 0 missing values
For a more comprehensive dataset with many more features check out the "Yacht/Motorboat Pricing Data (10,000+ listings)" dataset. Link below:…
0 runs0 likes0 downloads0 reach0 impact
2530 instances - 18 features - classes - 11212 missing values
Context I was searching for a master degree program in data-science when I found this awesome website mastersportal, So I just scrapped it to take my time analysing all master programs available…
0 runs0 likes0 downloads0 reach0 impact
60425 instances - 23 features - classes - 178594 missing values
Context This dataset was a part of the assignment of my coursework. Content The dataset contains 90+ columns describing different aspects of all countries like GDP, Population, Electricity-consumption…
0 runs0 likes0 downloads0 reach0 impact
197 instances - 80 features - classes - 0 missing values
Context Includes data on confirmed cases, deaths, hospitalizations, and testing, as well as other variables of potential interest. Content As of 26 January 2021, the columns are: isocode, continent,…
0 runs0 likes0 downloads0 reach0 impact
63381 instances - 59 features - classes - 1508423 missing values
Context This is home value data for the hot Nashville market. Content There are 56,000+ rows altogether. However, I'm missing home detail data for about half. So if anyone wants to track that down…
0 runs0 likes0 downloads0 reach0 impact
56636 instances - 31 features - classes - 648773 missing values
Context Hourly weather data for New York City. Extracted from online web sources. The following data set is cleaned for the purpose for NYC Taxi ETA calculation. Content We have features such as Date,…
0 runs0 likes0 downloads0 reach0 impact
5141 instances - 7 features - classes - 4 missing values
Context This is a forked subset of the Uber Pickups in New York City, enriched with weather, borough, and holidays information. Content I created this dataset for a personal project of exploring and…
0 runs0 likes0 downloads0 reach0 impact
29101 instances - 13 features - classes - 3043 missing values
Context Minneapolis air quality survey results Content Contained in the file are Minneapolis air quality survey results obtained between November 2013 and August 2014. The data set was obtained from…
0 runs0 likes0 downloads0 reach0 impact
4790 instances - 18 features - 0 classes - 2999 missing values
Context Since Myanmar is one of the developing countries, a lot of factories were set up and the number of cars increased speedily during the previous years. Therefore, Myanmar's air quality was also…
0 runs0 likes0 downloads0 reach0 impact
5122 instances - 16 features - classes - 0 missing values
Context This dataset contains information from all 400 Pokemon in generation eight. Content No.: Pokedex number Name: Pokemon name in english Ability1: Pokemon ability Ability2: Pokemon second ability…
0 runs0 likes0 downloads0 reach0 impact
400 instances - 19 features - classes - 369 missing values
Personal Loan product is an unsecured loan therefore it is vital to assess the risk of the customers by checking their credit worthiness. This must be done to prevent loan defaults. The objective is…
0 runs0 likes0 downloads0 reach0 impact
119528 instances - 32 features - classes - 987539 missing values
This dataset is now updated annually here. Context This dataset contains the salary, pay rate, and total compensation of every New York City employee. In this dataset this information is provided for…
0 runs0 likes0 downloads0 reach0 impact
2194488 instances - 16 features - classes - 1398151 missing values
Context Delinquency is a condition that arises when an activity or situation does not occur at its scheduled (or expected) date i.e., it occurs later than expected. Content Many donors, experts, and…
0 runs0 likes0 downloads0 reach0 impact
209593 instances - 35 features - classes - 0 missing values
This dataset contains estimates of the socioeconomic status (SES) position of each of 149 countries covering the period 1880-2010. Measures of SES, which are in decades, allow for a 130 year…
0 runs0 likes0 downloads0 reach0 impact
1036 instances - 10 features - classes - 0 missing values
Context The original dataset contains 1000 entries with 20 categorial/symbolic attributes prepared by Prof. Hofmann. In this dataset, each entry represents a person who takes a credit by a bank. Each…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 10 features - classes - 577 missing values
Context The subject matter of this dataset explores Tesla's stock price from its initial public offering (IPO) to yesterday. Content Within the dataset one will encounter the following: The date -…
0 runs0 likes0 downloads0 reach0 impact
1692 instances - 7 features - classes - 0 missing values
Context Space Apps Moscow was held on April 29th 30th. Thank you to the 175 people who joined the International Space Apps Challenge at this location! Content The dataset contains such columns as:…
0 runs0 likes0 downloads0 reach0 impact
32686 instances - 11 features - classes - 0 missing values
Background When is my university campus gym least crowded, so I know when to work out? We measured how many people were in this gym once every 10 minutes over the last year. We want to be able to…
0 runs0 likes0 downloads0 reach0 impact
62184 instances - 11 features - classes - 0 missing values
Context Emotions Detection Is an Interesting Blend of Psychology and Technology. -This Technology Helps to Build a Companion Robots,This Robots Can be Friendly and Have The Ability to Recognize Users…
0 runs0 likes0 downloads0 reach0 impact
1104 instances - 8 features - classes - 0 missing values
Context The stock prices dataset of a ticker is a good start to slice and dice and good for forecasting of the stock prices. The GOOG ticker data is taken Content Dataset is comprising of the below…
0 runs0 likes0 downloads0 reach0 impact
249 instances - 6 features - classes - 0 missing values
Context Content What's inside is more than just rows and columns. Make it easy for others to get started by describing how you acquired the data and what time period it represents, too.…
0 runs0 likes0 downloads0 reach0 impact
3923 instances - 12 features - classes - 2786 missing values
Context I wanted a highly imbalanced dataset to share with others. It has the perfect one for us. Imbalanced data typically refers to a classification problem where the number of observations per…
0 runs0 likes0 downloads0 reach0 impact
9578 instances - 14 features - classes - 0 missing values
Context This works focuses upon creating a data set on Pandas Q/A over StackOverflow. Presently, there are more than 90k+ questions available on StackOverflow which have been asked under Pandas…
0 runs0 likes0 downloads0 reach0 impact
87241 instances - 16 features - classes - 472864 missing values
Context I scraped all of the currently available Urban Dictionary pages (611) on 3/26/17 Content word - the slang term added to urban dictionary definition - the definition of said term author - the…
0 runs0 likes0 downloads0 reach0 impact
4272 instances - 6 features - classes - 0 missing values
Apple stock price of each work day since January 1st 2021. Contains highest price, lowest price, open, close, volume and adjusted close
0 runs0 likes0 downloads0 reach0 impact
348 instances - 6 features - 0 classes - 0 missing values
MEG data from auditory stimulation experiment using 305 sensors. The design matrix/forward operator is `data[:, :7498]`. The measurements for left stimulation are `data[:, 7498:7583]`. The…
0 runs0 likes0 downloads0 reach0 impact
305 instances - 7498 features - classes - 0 missing values
MEG data from auditory stimulation experiment using 305 sensors. The design matrix/forward operator is `data[:, :7498]`. The measurements for left stimulation are `data[:, 7498:7583]`. The…
0 runs0 likes0 downloads0 reach0 impact
305 instances - 7668 features - classes - 0 missing values
The data comprises the brand, type and model. For each car we have information about the cubic capacity (in cm3), the maximal power of engine (in kW), the maximal torque (in Nm), the number of seats,…
0 runs0 likes0 downloads0 reach0 impact
475 instances - 13 features - classes - 24 missing values
will delete again
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
This data set measures the running time of a matrix-matrix product $A \times B = C$, where all matrices have size 2048 x 2048, using a parameterizable *SGEMM GPU* (Single Precision General Matrix…
0 runs0 likes0 downloads0 reach0 impact
241600 instances - 15 features - 0 classes - 0 missing values
The data relates to an inverse dynamics problem for a seven degrees-of-freedom SARCOS anthropomorphic robot arm. The task is to map from a 21-dimensional input space (7 joint positions, 7 joint…
0 runs0 likes0 downloads0 reach0 impact
44484 instances - 22 features - 0 classes - 0 missing values
This dataset has been compiled from public sources. The dataset consists of daily temperatures and precipitation from 13 Canadian centres. Precipitation is either rain or snow (likely snow in winter…
0 runs0 likes0 downloads0 reach0 impact
29221 instances - 27 features - classes - 53518 missing values
Context From World Health Organization - On 31 December 2019, WHO was alerted to several cases of pneumonia in Wuhan City, Hubei Province of China. The virus did not match any other known virus. This…
0 runs0 likes0 downloads0 reach0 impact
36137 instances - 36 features - classes - 314500 missing values