Data
Filter results by:
Context Manufacturing process feature selection and categorization Content Abstract: Data from a semi-conductor manufacturing process Data Set Characteristics: Multivariate Number of Instances: 1567…
0 runs0 likes0 downloads0 reach0 impact
1567 instances - 592 features - classes - 41951 missing values
Context This dataset is created for the prediction of future New York Housing Price based on the past 17 years of record. Content Please check the details under the column description.…
0 runs0 likes0 downloads0 reach0 impact
1600202 instances - 10 features - classes - 40496 missing values
Title: Communities and Crime Abstract: Communities within the United States. The data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and…
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 128 features - 0 classes - 39202 missing values
Ignores community name.**Author**: Title: Communities and Crime Abstract: Communities within the United States. The data combines socio-economic data from the 1990 US Census, law enforcement data from…
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 127 features - 0 classes - 39202 missing values
test
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 127 features - 0 classes - 39202 missing values
Communities and Crime (Binarized) The following changes were introduced to OpenML Dataset 315. * binarized 'racepctblack' at 0.06 binarized 'ViolentCrimesPerPop' at 0.2
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 129 features - 2 classes - 39202 missing values
Communities and Crime (Binarized) The following changes were introduced to OpenML Dataset 315. * binarized 'racepctblack' at 0.06 binarized 'ViolentCrimesPerPop' at 0.2
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 129 features - 0 classes - 39202 missing values
## **Meta-Album Animals with Attributes Dataset (Extended)** The original Animals with Attributes 2 (AWA) dataset (https://cvml.ist.ac.at/AwA2/) was designed to benchmark transfer-learning algorithms,…
0 runs0 likes0 downloads0 reach1 impact
37318 instances - 3 features - 50 classes - 37318 missing values
From original source: ----- Communities within the United States. The data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and crime data from…
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 123 features - 0 classes - 36851 missing values
A modified version of the European Soccer Database by Hugo Mathien (https://www.kaggle.com/hugomathien). New players' performance indicators have been created from the original dataset. Data features:…
0 runs0 likes0 downloads0 reach0 impact
25979 instances - 31 features - classes - 35756 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
754 runs0 likes0 downloads0 reach0 impact
8844 instances - 57 features - 2 classes - 34843 missing values
Improve on the state of the art in credit scoring by predicting the probability that somebody will experience financial distress in the next two years. yeah ## Description Banks play a crucial role in…
0 runs0 likes0 downloads0 reach0 impact
150000 instances - 11 features - 2 classes - 33655 missing values
Context https://www.kaggle.com/c/birdsong-recognition Content This is information captured about birds recording from France Acknowledgements We wouldn't be here without the help of many contributors…
0 runs0 likes0 downloads0 reach0 impact
20851 instances - 26 features - classes - 32566 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
733 runs0 likes0 downloads0 reach0 impact
7485 instances - 56 features - 2 classes - 32427 missing values
## **Meta-Album RESISC Dataset (Extended)** RESISC45 dataset(https://gcheng-nwpu.github.io/) gathers 700 RGB images of size 256x256 px for each of 45 scene categories. The data authors strive to…
0 runs0 likes0 downloads0 reach1 impact
31500 instances - 3 features - 45 classes - 31500 missing values
21+ years of the euro 04 Jan 1999 - 12 Feb 2021 It wasn't until 1999 that the euro really began its journey, when 11 countries (Austria, Belgium, Finland, France, Germany, Ireland, Italy, Luxembourg,…
0 runs0 likes0 downloads0 reach0 impact
6007 instances - 41 features - classes - 31244 missing values
Content This is a dataset I started building for my future personal projects, as I think this kind of data is quite hard to acquire for free and in short time. I started acquiring data on March 21st,…
0 runs0 likes0 downloads0 reach0 impact
193279 instances - 4 features - classes - 29954 missing values
The goal of this challenge is to expose the research community to real world datasets of interest to 4Paradigm. All datasets are formatted in a uniform way, though the type of data might differ. The…
1 runs0 likes0 downloads0 reach15 impact
31406 instances - 23 features - 2 classes - 29756 missing values
Official World Golf Ranking Data Context: The Official World Golf Ranking is a system for rating the performance level of male professional golfers. It was started in 1986. [1] The rankings are based…
0 runs0 likes0 downloads0 reach0 impact
9000 instances - 12 features - classes - 29318 missing values
Context As a fan of anime (Japanese animated media), I have always wanted to have information about all the anime I can get my hands on. Content This data was scraped from anime-planet on June 15,…
0 runs0 likes0 downloads0 reach0 impact
14578 instances - 18 features - classes - 28902 missing values
Context WallStreetBets (r/wallstreetbets, also known as WSB), is a subreddit where participants discuss stock and option trading. It has become notable for its profane nature and allegations of users…
0 runs0 likes0 downloads0 reach0 impact
53187 instances - 7 features - classes - 28655 missing values
What is it ? This dataset is a record of 3.5 Million+ US Domestic Flights from 1990 to 2009. It has been taken from OpenFlights website which have a huge database of different travelling mediums…
0 runs0 likes0 downloads0 reach0 impact
3606803 instances - 15 features - classes - 27522 missing values
source: http://www.cs.ubc.ca/labs/beta/Projects/SATzilla/ authors: L. Xu, F. Hutter, H. Hoos, K. Leyton-Brown translator in coseal format: M. Lindauer with the help of Alexandre Frechette the data do…
0 runs0 likes0 downloads0 reach0 impact
4440 instances - 117 features - 0 classes - 27150 missing values
Context https://www.kaggle.com/c/birdsong-recognition Content This is information captured about birds recording from India Acknowledgements We wouldn't be here without the help of many contributors…
0 runs0 likes0 downloads0 reach0 impact
13654 instances - 26 features - classes - 26863 missing values
Context https://www.kaggle.com/c/birdsong-recognition Content This is information captured about birds recording from India Acknowledgements We wouldn't be here without the help of many contributors…
0 runs0 likes0 downloads0 reach0 impact
13654 instances - 26 features - classes - 26863 missing values
Context Scraped from https://twitter.com/dril January 2020. Content Dataset contains date of tweet, text of tweet, and other statistics such as likes and retweets. Acknowledgements Many thanks to…
0 runs0 likes0 downloads0 reach0 impact
8921 instances - 9 features - classes - 26162 missing values
Context As a community of responsible people can we focus to predict these disasters to save lives around. People are already going through worse as if that was not enough if a sudden earthquake comes…
0 runs0 likes0 downloads0 reach0 impact
13729 instances - 22 features - classes - 25869 missing values
## **Meta-Album Textures-ALOT Dataset (Extended)** Textures ALOT dataset (https://aloi.science.uva.nl/public_alot/) consists of 27 500 images from 250 categories. The images in the dataset are…
0 runs0 likes0 downloads0 reach1 impact
25000 instances - 3 features - 250 classes - 25000 missing values
Has the curve flattened? Countries around the world are working to flatten the curve of the coronavirus pandemic. Flattening the curve involves reducing the number of new COVID-19 cases from one day…
0 runs0 likes0 downloads0 reach0 impact
16659 instances - 28 features - classes - 24650 missing values
Data taken from ourworldindata.org For more data go here: Total confirmed cases: https://covid.ourworldindata.org/data/ecdc/total_cases.csv Total deaths:…
0 runs0 likes0 downloads0 reach0 impact
59354 instances - 10 features - classes - 24447 missing values
Context This is a subset from the Monitoring the Future Survey of high school students. My focus in working on this is trying to understand the effect that technology use has on grades (GPA). Content…
0 runs0 likes0 downloads0 reach0 impact
2202 instances - 24 features - classes - 24084 missing values
No description available
0 runs0 likes0 downloads0 reach0 impact
190776 instances - 20 features - 0 classes - 22484 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
13779 runs0 likes16 downloads16 reach15 impact
898 instances - 39 features - 5 classes - 22175 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
712 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 2 classes - 22175 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
## **Meta-Album Dogs Dataset (Extended)** Researchers from Stanford University created the original Dogs dataset (http://vision.stanford.edu/aditya86/ImageNetDogs/). It contains more than 20 000…
0 runs0 likes0 downloads0 reach1 impact
20480 instances - 3 features - 120 classes - 20480 missing values
Subsampling of the dataset APSFailure (41138) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 2 classes - 19825 missing values
No data.
283 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 11 classes - 19667 missing values
No data.
296 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 9 classes - 19667 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 9 classes - 19667 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
96 instances - 4027 features - 11 classes - 19667 missing values
The goal of the research is to help the auditors by building a classification model that can predict the fraudulent firm on the basis the present and historical risk factors. The information about the…
0 runs0 likes0 downloads0 reach0 impact
1552 instances - 37 features - 0 classes - 19402 missing values
Subsampling of the dataset APSFailure (41138) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 2 classes - 19156 missing values
Context UFC 257: Poirier vs. McGregor 2 was a mixed martial arts event produced by the Ultimate Fighting Championship that took place on January 24, 2021 at the Etihad Arena on Yas Island, Abu Dhabi,…
0 runs0 likes0 downloads0 reach0 impact
40819 instances - 14 features - classes - 19084 missing values
## **Meta-Album OmniPrint-MD-mix Dataset (Micro)** OmniPrint-MD-mix dataset consists of 28 240 images (128x128, RGB) from 706 categories. The images are synthesized with OmniPrint, and no further…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 69 features - 0 classes - 18840 missing values
test
0 runs0 likes0 downloads0 reach0 impact
8553 instances - 10 features - classes - 18454 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
8378 instances - 123 features - classes - 18372 missing values
This data was gathered from participants in experimental speed dating events from 2002-2004. During the events, the attendees would have a four-minute "first date" with every other participant of the…
28211 runs19 likes170 downloads189 reach36 impact
8378 instances - 121 features - 2 classes - 18372 missing values
Author: Marius Lindauer Date: 27.02.2014 These data set was generated for a publication about claspfolio 2.0, i.e., an algorithm selector for ASP. The algorithm portfolio of clasp (2.1.4)…
0 runs0 likes0 downloads0 reach8 impact
1294 instances - 143 features - 11 classes - 18258 missing values
Subsampling of the dataset APSFailure (41138) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 2 classes - 17956 missing values
Context Models of human cognition hold that information processing occurs in a series of stages. Cognitive psychology, in particular, is concerned with the internal mental processes that begin with…
0 runs0 likes0 downloads0 reach0 impact
6854 instances - 23 features - classes - 16924 missing values
Acknowledgements This data was scraped from http://house.speakingsame.com/ and includes data from 322 Perth suburbs, resulting in an average of about 100 rows per suburb. Content I believe the columns…
0 runs0 likes0 downloads0 reach0 impact
33656 instances - 18 features - classes - 16585 missing values
## **Meta-Album Cars Dataset (Extended)** The original Cars dataset (https://ai.stanford.edu/~jkrause/cars/car_dataset.html) was collected in 2013, and it contains more than 16 000 images from 196…
0 runs0 likes0 downloads0 reach1 impact
16185 instances - 3 features - 196 classes - 16185 missing values
Subsampling of the dataset APSFailure (41138) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 2 classes - 15654 missing values
Google Play Store Google Play, formerly Android Market, is a digital distribution service operated and developed by Google. It serves as the official app store for certified devices running on the…
0 runs0 likes0 downloads0 reach0 impact
12495 instances - 12 features - classes - 15602 missing values
Subsampling of the dataset APSFailure (41138) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 2 classes - 15419 missing values
Context I am developing my data science skills in areas outside of my previous work. An interesting problem for me was to identify which factors influence life expectancy on a national level. There is…
0 runs0 likes0 downloads0 reach0 impact
3111 instances - 32 features - classes - 15246 missing values
Context The Philippine Statistics Authority (PSA) spearheads the conduct of the Family Income and Expenditure Survey (FIES) nationwide. The survey, which is undertaken every three (3) years, is aimed…
0 runs0 likes0 downloads0 reach0 impact
41544 instances - 60 features - classes - 15072 missing values
## **Meta-Album Subcellular Human Protein Dataset (Extended)** This dataset is a subset of the Subcellular dataset in the Protein Atlas project(https://www.proteinatlas.org/). The original dataset,…
0 runs0 likes0 downloads0 reach1 impact
15050 instances - 3 features - 21 classes - 15050 missing values
Context Since 2008, guests and hosts have used Airbnb to travel in a more unique, personalized way. As part of the Airbnb Inside initiative, this dataset describes the listing activity of homestays in…
0 runs0 likes0 downloads0 reach0 impact
3845 instances - 51 features - classes - 14267 missing values
Context There are some great UFC datasets out there, but I could not find one that included gambling odds. So I went and made one myself. This dataset focuses very generally on the fights and hopes to…
0 runs0 likes0 downloads0 reach0 impact
5528 instances - 11 features - classes - 14168 missing values
About Dataset This data set includes15K Fifa20 Players with 15+ features and their images , including their position, age, and Country, and many more. It can be used for learning Statistics,…
0 runs0 likes0 downloads0 reach0 impact
14999 instances - 74 features - classes - 14009 missing values
Context Since Amazon Echo Dot 2 has been the best selling Alexa product, we decided to extract the reviews posted on Amazon for this device. This particular dataset contains reviews posted in…
0 runs0 likes0 downloads0 reach0 impact
6855 instances - 10 features - 0 classes - 13903 missing values
This dataset contains moonrise, moonset and lunar phase timings for every date from 2005 to 2017 for London, UK*; collected from timeanddate. Inspiration This data can be used to study the effects of…
0 runs0 likes0 downloads0 reach0 impact
4748 instances - 8 features - classes - 13280 missing values
Context Melbourne real estate is BOOMING. Can you find the insight or predict the next big trend to become a real estate mogul or even harder, to snap up a reasonably priced 2-bedroom unit? Content…
0 runs0 likes0 downloads0 reach0 impact
13580 instances - 21 features - classes - 13256 missing values
### Description: The dataset, named `melb_data.csv`, represents detailed information about real estate sales in various suburbs across Melbourne. It captures specific characteristics of residential…
0 runs0 likes0 downloads0 reach0 impact
13580 instances - 21 features - classes - 13256 missing values
Subsampling of the dataset albert (41147) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 79 features - 2 classes - 13059 missing values
Context For Past a decade, Airbnb has emerged as a great personalized staying option for customers worldwide.This dataset gives the details of Airbnb listings in Buenos Aires as on 24th November 2019…
0 runs0 likes0 downloads0 reach0 impact
22877 instances - 15 features - classes - 12925 missing values
Subsampling of the dataset albert (41147) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 79 features - 2 classes - 12888 missing values
Subsampling of the dataset albert (41147) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 79 features - 2 classes - 12826 missing values
Subsampling of the dataset albert (41147) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 79 features - 2 classes - 12697 missing values
Subsampling of the dataset albert (41147) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 79 features - 2 classes - 12678 missing values
This dataset is from the "Explainable Machine Learning Challenge": > The Explainable Machine Learning Challenge is a collaboration between Google, FICO and academics at Berkeley, Oxford, Imperial, UC…
0 runs0 likes0 downloads0 reach0 impact
9871 instances - 24 features - 2 classes - 12643 missing values
## **Meta-Album Birds Dataset (Mini)** When Meta-Album was created, the Birds dataset(https://www.kaggle.com/datasets/gpiosenka/100-bird-species) contained images of 315 bird species, but now it has…
0 runs0 likes0 downloads0 reach1 impact
12600 instances - 3 features - 315 classes - 12600 missing values
Context This dataset was created when I practiced webscraping. Content The data is a compilation of information on dogs who were available for adoption on December 12, 2019 in the Hungarian Database…
0 runs0 likes0 downloads0 reach0 impact
2937 instances - 19 features - classes - 12405 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
66 instances - 4027 features - 3 classes - 12269 missing values
A certain premium club boasts a large customer membership. The members pay an annual membership fee in return for using the exclusive facilities offered by this club. The fees are customized for every…
0 runs0 likes0 downloads0 reach0 impact
10362 instances - 15 features - 2 classes - 12224 missing values
A certain premium club boasts a large customer membership. The members pay an annual membership fee in return for using the exclusive facilities offered by this club. The fees are customized for every…
0 runs0 likes0 downloads0 reach0 impact
10362 instances - 15 features - 2 classes - 12224 missing values
test
0 runs0 likes0 downloads0 reach0 impact
10173 instances - 65 features - classes - 12157 missing values
COVID19-Algeria-and-World-Dataset A coronavirus dataset with 104 countries constructed from different reliable sources, where each row represents a country, and the columns represent geographic,…
0 runs0 likes0 downloads0 reach0 impact
38472 instances - 15 features - classes - 11759 missing values
Context Pantheon is a project celebrating the cultural information that endows our species with these fantastic capacities. To celebrate our global cultural heritage we are compiling, analyzing and…
0 runs0 likes0 downloads0 reach0 impact
11341 instances - 17 features - classes - 11329 missing values
Source: Charles Gaydon This data only contains 5 variables of Productcode, Warehouse, ProductCategory, Date, Order_demand I showed that it is possible, with trivial models, to lower the mean average…
0 runs0 likes0 downloads0 reach0 impact
1048575 instances - 5 features - classes - 11239 missing values
For a more comprehensive dataset with many more features check out the "Yacht/Motorboat Pricing Data (10,000+ listings)" dataset. Link below:…
0 runs0 likes0 downloads0 reach0 impact
2530 instances - 18 features - classes - 11212 missing values
Annual salary information including gross pay and overtime pay for all active, permanent employees of Montgomery County, MD paid in calendar year 2016. This information will be published annually each…
0 runs0 likes0 downloads0 reach0 impact
9228 instances - 13 features - 0 classes - 11169 missing values
About our Dataset The journey of the collection of this Covid-19 India dataset begin with a competition where we have to do sentiment analysis of tweets. The data was collected from…
0 runs0 likes0 downloads0 reach0 impact
648958 instances - 4 features - classes - 10980 missing values
### Description ### This dataset is part of a collection datasets based on the game "Jungle Chess" (a.k.a. Dou Shou Qi). For a description of the rules, please refer to the paper (link attached). The…
11 runs0 likes0 downloads0 reach0 impact
44819 instances - 47 features - 3 classes - 10584 missing values
Context The Social Dilemma, a documentary-drama hybrid explores the dangerous human impact of social networking, with tech experts sounding the alarm on their own creations as the tech experts sound…
0 runs0 likes0 downloads0 reach0 impact
20068 instances - 14 features - classes - 10429 missing values
## **Meta-Album Sports Actions Dataset (Extended)** The 100-Sports dataset(https://www.kaggle.com/datasets/gpiosenka/sports-classification) is a collection of sports images covering 73 different…
0 runs0 likes0 downloads0 reach1 impact
10416 instances - 3 features - 73 classes - 10416 missing values
Context I am a really huge football fan and the Premier League is one of my favourite football (or soccer, whatever you like to call it) leagues. So, as my very first dataset, I thought this would be…
0 runs0 likes0 downloads0 reach0 impact
571 instances - 59 features - classes - 10224 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles Online Product Sales competition…
0 runs0 likes0 downloads0 reach0 impact
639 instances - 413 features - classes - 10012 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
20000 instances - 17 features - 3 classes - 10000 missing values
## **Meta-Album Textures-ALOT Dataset (Mini)** Textures ALOT dataset (https://aloi.science.uva.nl/public_alot/) consists of 27 500 images from 250 categories. The images in the dataset are captured in…
0 runs0 likes0 downloads0 reach1 impact
10000 instances - 3 features - 250 classes - 10000 missing values
test
0 runs0 likes0 downloads0 reach0 impact
10503 instances - 65 features - classes - 9888 missing values
Context Throughout the world of data science, there are many languages and tools that can be used to complete a given task. While you are often able to use whichever tool you prefer, it is often…
0 runs0 likes0 downloads0 reach0 impact
10153 instances - 4 features - classes - 9824 missing values
## **Meta-Album Airplanes Dataset (Extended)** The original Airplanes dataset (https://zenodo.org/record/3464319) comprises more than 9 000 remote sensing images acquired from Google Earth satellite…
0 runs0 likes0 downloads0 reach1 impact
9625 instances - 3 features - 21 classes - 9625 missing values
Context This dataset is generated after an epxloratory analysis of data provided by Carbon Disclosure Project in the following competition: CDP: Unlocking Climate Solutions. It contains a compilation…
0 runs0 likes0 downloads0 reach0 impact
615 instances - 58 features - classes - 9612 missing values
Content The data contains Best Sellers List published by The New York Times every Sunday. The temporal range is from 03-Jan-2010 to 29-Dec-2019 which makes it a whole decade of data. Each week, 5…
0 runs0 likes0 downloads0 reach0 impact
61430 instances - 12 features - classes - 9476 missing values
Subsampling of the dataset KDDCup09-Upselling (43072) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 2 classes - 9426 missing values
daily pickup data for 329 FHV companies from January 2015 through August 2015. From original source: ----- There is also a file other-FHV-data-jan-aug-2015.csv containing daily pickup data for 329 FHV…
0 runs0 likes0 downloads0 reach0 impact
29826 instances - 5 features - classes - 9276 missing values
Content This is a huge dataset that contains every web series around the globe streaming right now at the date of the creation of the dataset. Inspiration This dataset can be used to answer the…
0 runs0 likes0 downloads0 reach0 impact
12353 instances - 9 features - classes - 9257 missing values