OpenML
Filter results by:
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 25108569 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 25108569 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 25108569 missing values
DATA
0 runs0 likes0 downloads0 reach0 impact
3119345 instances - 88 features - classes - 24532528 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach1 impact
50000 instances - 14892 features - 2 classes - 19658569 missing values
Public procurement data for the European Economic Area, Switzerland, and the Macedonia. 2015
0 runs0 likes0 downloads0 reach0 impact
565163 instances - 75 features - 0 classes - 15247061 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 14616450 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 14616450 missing values
This is the full version of the KDD Cup 2009 dataset Customer Relationship Management (CRM) is a key element of modern marketing strategies. The KDD Cup 2009 offers the opportunity to work on large…
0 runs0 likes0 downloads0 reach0 impact
50000 instances - 15001 features - 2 classes - 14616450 missing values
Domain dataset
0 runs0 likes0 downloads0 reach0 impact
1637 instances - 9839 features - 3 classes - 13231887 missing values
General Description 2015-current: greater than $200.00. The Commission categorizes contributions from individuals using the calendar year-to-date amount for political action committee (PAC) and party…
0 runs0 likes0 downloads0 reach0 impact
3348209 instances - 21 features - 0 classes - 10786577 missing values
Context and Content The COVID-19 case surveillance system database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of…
0 runs0 likes0 downloads0 reach0 impact
8405079 instances - 11 features - classes - 9543526 missing values
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
1692056 instances - 26 features - classes - 9275842 missing values
Home Credit Default Risk Main Table > Huang, X., Khetan, A., Cvitkovic, M., & Karnin, Z. (2020). > Tabtransformer: Tabular data modeling using contextual embeddings. > arXiv preprint…
0 runs0 likes0 downloads0 reach0 impact
307511 instances - 121 features - 2 classes - 9152465 missing values
The KDD Cup 2009 offers the opportunity to work on large marketing databases from the French Telecom company Orange to predict the propensity of customers to switch provider (churn). Churn (wikipedia…
10985 runs0 likes0 downloads0 reach0 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
11305 runs0 likes0 downloads0 reach0 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
Datasets from ACM KDD Cup (http://www.sigkdd.org/kddcup/index.php) KDD Cup 2009 http://www.kddcup-orange.com Converted to ARFF format by TunedIT Customer Relationship Management (CRM) is a key element…
223 runs0 likes18 downloads18 reach19 impact
50000 instances - 231 features - 2 classes - 8024152 missing values
This dataset contains traffic violation information from all electronic traffic violations issued in the County. Any information that can be used to uniquely identify the vehicle, the vehicle owner or…
0 runs0 likes0 downloads0 reach0 impact
1578154 instances - 43 features - 4 classes - 8006541 missing values
This dataset reflects incidents of crime in the City of Los Angeles dating back to 2010. This data is transcribed from original crime reports that are typed on paper and therefore there may be some…
0 runs0 likes0 downloads0 reach0 impact
1468825 instances - 26 features - 0 classes - 7881776 missing values
Experiment data obtained by running random configurations of xgboost through mlr on 118 different classification tasks from openml. Parameter descriptions:…
0 runs0 likes0 downloads0 reach0 impact
2955210 instances - 21 features - classes - 7051006 missing values
Bleichenbacher Padding Attack: Dataset created on 2023-04-28 with server OpenSSL097b Attribute Names: TLS0:tcp.srcport TLS0:tcp.dstport TLS0:tcp.port TLS0:tcp.stream TLS0:tcp.len TLS0:tcp.seq…
0 runs0 likes0 downloads0 reach0 impact
219915 instances - 125 features - 11 classes - 6597420 missing values
Bleichenbacher Padding Attack: Dataset created on 2023-04-28 with server OpenSSL097a Attribute Names: TLS0:tcp.srcport TLS0:tcp.dstport TLS0:tcp.port TLS0:tcp.stream TLS0:tcp.len TLS0:tcp.seq…
0 runs0 likes0 downloads0 reach0 impact
219913 instances - 125 features - 11 classes - 6597360 missing values
Context Real Estate inventory of listings from 2012-2017 Content Includes data for all Real Estate listings in the US, such as, active listings, prices, days on market, price changes, and pending…
0 runs0 likes0 downloads0 reach0 impact
974066 instances - 34 features - classes - 5792129 missing values
Dataset KDD98 challenge: https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html The goal is to estimate the return from a direct mailing in order to maximize donation profits. This dataset…
0 runs0 likes0 downloads0 reach0 impact
191260 instances - 479 features - 0 classes - 5587563 missing values
The pandemic context brings new challenges to cities. This fantastic resource was created by Google with aggregated, anonymized sets of data from users who have turned on the Location History setting…
0 runs0 likes0 downloads0 reach0 impact
1048575 instances - 14 features - classes - 5573689 missing values
This is the datasets from the Kaggle Higgs Boson Machine Learning Challenge 2014. The data was downloaded from the [CERN website](http://opendata.cern.ch/record/328), which also hosts the…
0 runs0 likes0 downloads0 reach0 impact
818238 instances - 31 features - 2 classes - 5168486 missing values
Coronavirus Country Profiles We built 207 country profiles which allow you to explore the statistics on the coronavirus pandemic for every country in the world. In a fast-evolving pandemic it is not a…
0 runs0 likes0 downloads0 reach0 impact
170646 instances - 66 features - classes - 5082293 missing values
This is the datasets from the Kaggle Higgs Boson Machine Learning Challenge 2014. The data was downloaded from the [CERN website](http://opendata.cern.ch/record/328), which also hosts the…
0 runs0 likes0 downloads0 reach0 impact
800000 instances - 31 features - 2 classes - 5053446 missing values
Context This is historical data on cryptocurrency tradings for the period from 2016-01-01 to 2021-02-21. If you enjoy this dataset please upvote so I can see it is popular and I need to update it.…
0 runs1 likes0 downloads1 reach0 impact
2382643 instances - 17 features - classes - 4862194 missing values
Context The International Chess Federation (FIDE) governs international chess competition. FIDE used Elo rating system for calculating the relative skill levels of players. Content The dataset…
0 runs0 likes0 downloads0 reach0 impact
977687 instances - 10 features - classes - 3994218 missing values
()[]
0 runs0 likes0 downloads0 reach0 impact
2845342 instances - 46 features - classes - 3414349 missing values
Description This is a countrywide car accident dataset, which covers 49 states of the USA. The accident data are collected from February 2016 to Dec 2020, using two APIs that provide streaming traffic…
0 runs0 likes0 downloads0 reach0 impact
2845342 instances - 46 features - classes - 3414349 missing values
Context Getting access to high-quality historical stock market data can be very expensive and/or complicated; parsing SEC 10-Q filings direct from the SEC EDGAR is difficult due to the varying…
0 runs1 likes0 downloads1 reach0 impact
101787 instances - 45 features - classes - 2857964 missing values
This dataset combines records from the MLCQ dataset with metrics extracted using the PMD Tool and the Understand tool, to determine whether a file contains code smells. Please note that the records…
0 runs0 likes0 downloads0 reach0 impact
86467 instances - 67 features - 0 classes - 2852906 missing values
This dataset combines records from the MLCQ dataset with metrics extracted using the PMD Tool and the Understand tool, to determine whether a file contains code smells. Please note that the records…
0 runs0 likes0 downloads0 reach0 impact
83943 instances - 67 features - 0 classes - 2801627 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
0 runs0 likes1 downloads1 reach18 impact
425240 instances - 79 features - 2 classes - 2734000 missing values
Dataset KDD98 challenge: https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html The goal is to estimate the return from a direct mailing in order to maximize donation profits. This dataset…
0 runs0 likes1 downloads1 reach9 impact
82318 instances - 478 features - 2 classes - 2399311 missing values
Bleichenbacher Padding Attack: Dataset created on 2021-07-23 with server OpenSSL-1.1.1k Attribute Names: TLS0:tcp.srcport TLS0:tcp.dstport TLS0:tcp.port TLS0:tcp.stream TLS0:tcp.len TLS0:tcp.seq…
0 runs0 likes0 downloads0 reach0 impact
49977 instances - 117 features - 11 classes - 2196060 missing values
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective accident.…
0 runs0 likes1 downloads1 reach1 impact
363243 instances - 67 features - 3 classes - 2181757 missing values
UserID
0 runs0 likes0 downloads0 reach0 impact
1974675 instances - 10 features - classes - 1974675 missing values
web services evaluations in this table
0 runs0 likes0 downloads0 reach0 impact
1974675 instances - 10 features - classes - 1974675 missing values
PASS is a large-scale image dataset that does not include any humans and which can be used for high-quality pretraining while significantly reducing privacy concerns. Upload by OpenML team.
0 runs0 likes0 downloads0 reach0 impact
1439588 instances - 7 features - 94137 classes - 1775490 missing values
Bleichenbacher Padding Attack: Dataset created on 2023-12-05 with server facebook Attribute Names: CKE0:tcp.srcport CKE0:tcp.dstport CKE0:tcp.port CKE0:tcp.stream CKE0:tcp.len CKE0:tcp.seq…
0 runs0 likes0 downloads0 reach0 impact
19487 instances - 181 features - 11 classes - 1750620 missing values
Context This dataset deals with pollution in the U.S. Pollution in the U.S. has been well documented by the U.S. EPA but it is a pain to download all the data and arrange them in a format that…
0 runs0 likes0 downloads0 reach0 impact
1746661 instances - 29 features - classes - 1746230 missing values
Context Craigslist is the world's largest collection of used vehicles for sale, yet it's very difficult to collect all of them in the same place. I built a scraper for a school project and expanded…
0 runs0 likes0 downloads0 reach0 impact
426880 instances - 25 features - classes - 1655336 missing values
Context Craigslist is the world's largest collection of used vehicles for sale, yet it's very difficult to collect all of them in the same place. I built a scraper for a school project and expanded…
0 runs0 likes0 downloads0 reach0 impact
426880 instances - 25 features - classes - 1655336 missing values
Context All credit of this database goes to Tim Sevenhuysen of OraclesElixir.com. Im just uploading it here because I want to see what you guys do with this dataset before Worlds! Im super hyped!…
0 runs0 likes0 downloads0 reach0 impact
67980 instances - 103 features - classes - 1527593 missing values
Context Includes data on confirmed cases, deaths, hospitalizations, and testing, as well as other variables of potential interest. Content As of 26 January 2021, the columns are: isocode, continent,…
0 runs0 likes0 downloads0 reach0 impact
63381 instances - 59 features - classes - 1508423 missing values
This dataset is now updated annually here. Context This dataset contains the salary, pay rate, and total compensation of every New York City employee. In this dataset this information is provided for…
0 runs0 likes0 downloads0 reach0 impact
2194488 instances - 16 features - classes - 1398151 missing values
Context Google PlayStore App analytics. (1.1 Million + App Data) Source: https://github.com/gauthamp10/Google-Playstore-Dataset Content I've collected the data with the help of Python and Scrapy…
0 runs0 likes0 downloads0 reach0 impact
2312944 instances - 24 features - classes - 1355506 missing values
# Achieved Frames per Second (FPS) in video games This dataset contains FPS measurement of video games executed on computers. Each row of the dataset describes the outcome of FPS measurement (outcome…
0 runs0 likes0 downloads0 reach0 impact
425833 instances - 45 features - 0 classes - 1299988 missing values
Bleichenbacher Padding Attack: Dataset created on 2023-12-13 with server netscalergcm Attribute Names: TLS0:tcp.srcport TLS0:tcp.dstport TLS0:tcp.port TLS0:tcp.stream TLS0:tcp.len TLS0:tcp.seq…
0 runs0 likes0 downloads0 reach0 impact
7678 instances - 253 features - 11 classes - 1195562 missing values
Bleichenbacher Padding Attack: Dataset created on 2023-12-04 with server cisco Attribute Names: CCS0:tcp.srcport CCS0:tcp.dstport CCS0:tcp.port CCS0:tcp.stream CCS0:tcp.len CCS0:tcp.seq…
0 runs0 likes0 downloads0 reach0 impact
7714 instances - 253 features - 11 classes - 1157656 missing values
This is the dataset used for the 2016 IDA Industrial Challenge, courtesy of Scania. For a full description, see http://archive.ics.uci.edu/ml/datasets/IDA2016Challenge . This dataset contains both the…
9 runs0 likes2 downloads2 reach19 impact
76000 instances - 171 features - 2 classes - 1078695 missing values
Personal Loan product is an unsecured loan therefore it is vital to assess the risk of the customers by checking their credit worthiness. This must be done to prevent loan defaults. The objective is…
0 runs0 likes0 downloads0 reach0 impact
119528 instances - 32 features - classes - 987539 missing values
A Vicon motion capture camera system was used to record 12 users performing 5 hand postures with markers attached to a left-handed glove. A rigid pattern of markers on the back of the glove was used…
0 runs0 likes0 downloads0 reach0 impact
78096 instances - 38 features - classes - 974700 missing values
Overview Identification COUNTRY Rwanda TITLE Integrated Household Living Conditions Survey 2010-2011 TRANSLATED TITLE Enqute Intgrale sur les conditions de vie des mnages 2010-2011 STUDY TYPE…
0 runs0 likes0 downloads0 reach0 impact
78432 instances - 23 features - classes - 907612 missing values
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective accident
0 runs0 likes0 downloads0 reach0 impact
363206 instances - 66 features - 0 classes - 876555 missing values
P2P Lending I concatenated historical loans from both Prosper and Lending Club 2013 - 2018. Currently only the summary of the loan (terms, origination date, loan amount, status, etc) are up but…
0 runs0 likes0 downloads0 reach0 impact
2875146 instances - 18 features - classes - 863078 missing values
Training dataset of the 'Porto Seguros Safe Driver Prediction' Kaggle challenge [https://www.kaggle.com/c/porto-seguro-safe-driver-prediction]. The goal was to predict whether a driver will file an…
2 runs0 likes0 downloads0 reach12 impact
595212 instances - 38 features - 2 classes - 846458 missing values
Training dataset of the 'Porto Seguros Safe Driver Prediction' Kaggle challenge [https://www.kaggle.com/c/porto-seguro-safe-driver-prediction]. The goal was to predict whether a driver will file an…
0 runs0 likes0 downloads0 reach2 impact
595212 instances - 58 features - 2 classes - 846458 missing values
Context Infrastructure-as-code (IaC) is the DevOps strategy that allows management and provisioning of infrastructure through the definition of machine-readable files and automation around them,…
0 runs0 likes0 downloads0 reach0 impact
227272 instances - 113 features - 0 classes - 757758 missing values
## **Meta-Album OmniPrint-MD-mix Dataset (Mini)** OmniPrint-MD-mix dataset consists of 28 240 images (128x128, RGB) from 706 categories. The images are synthesized with OmniPrint, and no further…
0 runs0 likes0 downloads0 reach3 impact
28240 instances - 69 features - 0 classes - 665053 missing values
A copy of MD_MIX_Mini dataset from Meta album Set0
0 runs0 likes0 downloads0 reach0 impact
28240 instances - 69 features - 706 classes - 665053 missing values
Experiment data obtained by running random configurations of an SVM through mlr on 106 different classification tasks from openml.
0 runs0 likes0 downloads0 reach0 impact
540576 instances - 15 features - classes - 658962 missing values
Context This is home value data for the hot Nashville market. Content There are 56,000+ rows altogether. However, I'm missing home detail data for about half. So if anyone wants to track that down…
0 runs0 likes0 downloads0 reach0 impact
56636 instances - 31 features - classes - 648773 missing values
Context Huge Harry Potter fan. Wanted to collect fan-fiction data to make a dashboard and visualize it. Its in the works. Content I scraped this data from https://www.fanfiction.net/book/Harry-Potter/…
0 runs0 likes0 downloads0 reach0 impact
648493 instances - 16 features - classes - 647586 missing values
Bleichenbacher Padding Attack: Dataset created on 2023-12-13 with server panos Attribute Names: CCS0:tcp.srcport CCS0:tcp.dstport CCS0:tcp.port CCS0:tcp.stream CCS0:tcp.len CCS0:tcp.seq…
0 runs0 likes0 downloads0 reach0 impact
7869 instances - 197 features - 11 classes - 612770 missing values
bases-de-donnees-annuelles-des-accidents-corporels-de-la-circulation-routiere-annees-de-2005-a-2019
0 runs0 likes0 downloads0 reach0 impact
132977 instances - 55 features - 0 classes - 550521 missing values
Context UNESCO: Complete set of demographic and socioeconomic variables. UIS statistics contact: uis.datarequestsunesco.org Sources: United Nations, Department of Economic and Social Affairs,…
0 runs0 likes0 downloads0 reach0 impact
270038 instances - 9 features - classes - 538010 missing values
## **Meta-Album Plankton Dataset (Extended)** The Plankton dataset is created by researchers at the Woods Hole Oceanographic Institution (https://www.whoi.edu/). Imaging FlowCytobot (IFCB) was used…
0 runs0 likes0 downloads0 reach1 impact
473273 instances - 3 features - 102 classes - 473273 missing values
Context This works focuses upon creating a data set on Pandas Q/A over StackOverflow. Presently, there are more than 90k+ questions available on StackOverflow which have been asked under Pandas…
0 runs0 likes0 downloads0 reach0 impact
87241 instances - 16 features - classes - 472864 missing values
Product listing data submitted to the U.S. FDA for all unfinished, unapproved drugs.
0 runs0 likes0 downloads0 reach0 impact
120215 instances - 20 features - 7 classes - 443305 missing values
This version has feature names based on https://www2.1010data.com/documentationcenter/beta/Tutorials/MachineLearningExamples/CensusIncomeDataSet.html Missing data is also properly encoded in this…
0 runs0 likes1 downloads1 reach0 impact
199523 instances - 42 features - 2 classes - 415717 missing values
This data is derived from the 2012 KDD Cup. The data is subsampled to 1% of the original number of instances, downsampling the majority class (click=0) so that the target feature is reasonably…
0 runs0 likes0 downloads0 reach0 impact
798964 instances - 10 features - 3 classes - 399482 missing values
A csv file with 80,000+ tweets from January 6th, 2021 -- the day of the capitol hill riots. Made using the Twitter Developer API + Tweepy. Nowhere close to the size of the Parler data dumps, but…
0 runs0 likes0 downloads0 reach0 impact
82309 instances - 14 features - classes - 392323 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The river flow datasets concern the prediction of river network flows for 48 h in the future at…
0 runs0 likes0 downloads0 reach0 impact
9125 instances - 584 features - classes - 356160 missing values
The collection consists of six data sets containing telemetry data of the Mars Express Spacecraft (MEX), a spacecraft orbiting Mars and operated by the European Space Agency. The data, in terms of…
0 runs0 likes0 downloads0 reach0 impact
65952 instances - 518 features - classes - 331128 missing values
Traffic violations followed the invention of the automobile: the first traffic ticket in the United States was allegedly given to a New York City cab driver on May 20, 1899, for going at the breakneck…
0 runs0 likes0 downloads0 reach0 impact
1018634 instances - 34 features - classes - 328559 missing values
Context This data set contains measles vaccination rate data for 46,412 schools in 32 states across the US. Content Vaccination rates are for the 2017-201818 school year for the following states:…
0 runs0 likes0 downloads0 reach0 impact
66113 instances - 15 features - classes - 322786 missing values
Context From World Health Organization - On 31 December 2019, WHO was alerted to several cases of pneumonia in Wuhan City, Hubei Province of China. The virus did not match any other known virus. This…
0 runs0 likes0 downloads0 reach0 impact
36137 instances - 36 features - classes - 314500 missing values
Bleichenbacher Timing Attack: 20 micro seconds dataset created on 2022-09-19 Attribute Descriptions: CCS0:tcp.srcport: TCP Source Port of the first Change Cipher Spec TCP Acknowledgement…
0 runs0 likes0 downloads0 reach0 impact
9996 instances - 155 features - 11 classes - 299850 missing values
Context The official Goodread's API limits retrievable data, so I decided to scrape the actual HTTP pages and grab additional details on each book. Content Books are scraped from a list titles the…
0 runs0 likes0 downloads0 reach0 impact
52199 instances - 31 features - classes - 285951 missing values
Anonymized data of dating profiles from OkCupid
0 runs0 likes0 downloads0 reach0 impact
59946 instances - 31 features - 0 classes - 273249 missing values
Context This Online Retail II data set contains all the transactions occurring for a UK-based and registered, non-store online retail between 01/12/2009 and 09/12/2011.The company mainly sells unique…
0 runs0 likes0 downloads0 reach0 impact
1067371 instances - 8 features - classes - 247481 missing values
Multi-omics bladder cancer dataset containing incomplete modality samples from the TCGA project.
0 runs0 likes0 downloads0 reach0 impact
80 instances - 88518 features - classes - 235683 missing values
Context Every year Kaggle conducts an industry-wide survey that presents a truly comprehensive view of the state of data science and machine learning. This dataset combines the data from the past 4…
0 runs0 likes0 downloads0 reach0 impact
80327 instances - 12 features - classes - 226439 missing values
Context Craigslist is the world's largest collection of privately sold housing options, yet it's very difficult to collect all of them in the same place. I built this dataset as a means in by which to…
0 runs0 likes0 downloads0 reach0 impact
384977 instances - 22 features - classes - 223551 missing values
### Context Since its inception in 2008, Airbnb has disrupted the traditional hospitality industry as more travellers decide to use Airbnb as their primary means of accommodation. Airbnb offers…
0 runs0 likes0 downloads0 reach0 impact
226030 instances - 17 features - classes - 213880 missing values
Sanitized and anonymized Cargo 2000 (C2K) airfreight tracking and tracing events, covering five months of business execution (3,942 process instances, 7,932 transport legs, 56,082 activities). ###…
0 runs0 likes0 downloads0 reach0 impact
3943 instances - 98 features - classes - 210284 missing values
Context Thinking of Natural Language Processing as a beginner!! The dataset has been about the wine comments or reviews that has been given by various wine tasters. The concept was to use text…
0 runs0 likes0 downloads0 reach0 impact
129971 instances - 14 features - classes - 204754 missing values
130k wine reviews with variety, location, winery, price, and description. Downloaded from Kaggle [https://www.kaggle.com/zynicide/wine-reviews/home] on 29.10.2018. The original data was scraped from…
0 runs0 likes0 downloads0 reach0 impact
129971 instances - 13 features - 0 classes - 204752 missing values
COVID-19 Stats Trends Context This dataset seeks to provide insights into what has changed due to policies aimed at combating COVID-19 and evaluate the changes in community activities and its relation…
0 runs0 likes0 downloads0 reach0 impact
53377 instances - 22 features - classes - 201926 missing values
Author: Marius Lindauer Date: 27.02.2014 These data set was generated for a publication about claspfolio 2.0, i.e., an algorithm selector for ASP. The algorithm portfolio of clasp (2.1.4)…
0 runs0 likes0 downloads0 reach0 impact
14234 instances - 143 features - 0 classes - 200838 missing values
Interest and Motivation This dataset belongs to the MeToo movement on Twitter. This movement was against the sexual harassment incidents and many people posted various hatred tweets. Using this…
0 runs0 likes0 downloads0 reach0 impact
807174 instances - 9 features - classes - 197782 missing values
The dataset represents 10 years (1999-2008) of clinical care at 130 US hospitals and integrated delivery networks. It includes over 50 features representing patient and hospital outcomes. Information…
0 runs0 likes0 downloads0 reach0 impact
101766 instances - 48 features - 3 classes - 192849 missing values
uci
0 runs0 likes0 downloads0 reach0 impact
101766 instances - 52 features - classes - 192849 missing values
Context The projects goal is to use this dataset to predict the level of success game developers should expect given their game design details. Features such as 'Indie' (developed by indie studio),…
0 runs0 likes0 downloads0 reach0 impact
30250 instances - 27 features - classes - 188331 missing values
Context This dataset has an exhaustive list of player statistics for each season from 2013 - 2020 Content Each row is associated with a player and a season. Eg. You will have 7 rows for Lionel Messi:…
0 runs0 likes0 downloads0 reach0 impact
5733 instances - 109 features - classes - 187897 missing values
Context TFT TFT is an 8-player free-for-all drafting tactics game in which the player recruits powerful champions, deploys them, and battles to become the last player standing. When acquired, a…
0 runs0 likes0 downloads0 reach0 impact
748928 instances - 14 features - classes - 187360 missing values