Data
Filter results by:
No data.
30 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
31 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
30 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
34 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
28 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
* Title: Thoracic Surgery Data Data Set * Abstract: The data is dedicated to classification problem related to the post-operative life expectancy in the lung cancer patients: class 1 - death within…
145 runs0 likes0 downloads0 reach0 impact
470 instances - 17 features - 2 classes - 0 missing values
* Dataset: Reduced version (10 % of the examples) of bank-marketing dataset.
1259 runs0 likes0 downloads0 reach0 impact
4521 instances - 17 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
676 runs0 likes0 downloads0 reach0 impact
10992 instances - 17 features - 2 classes - 0 missing values
Abstract: The data set is composed of 60 chorales (5665 events) by J.S. Bach (1675-1750). Each event of each chorale is labelled using 1 among 101 chord labels and described through 14 features.…
31 runs0 likes0 downloads0 reach0 impact
5665 instances - 17 features - 102 classes - 0 missing values
The data was collected retrospectively at Wroclaw Thoracic Surgery Centre for patients who underwent major lung resections for primary lung cancer in the years 2007 - 2011. The Centre is associated…
31 runs0 likes0 downloads0 reach0 impact
470 instances - 17 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
13488 instances - 17 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
13488 instances - 17 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
13488 instances - 17 features - 2 classes - 0 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
0 runs0 likes0 downloads0 reach0 impact
45211 instances - 17 features - 0 classes - 0 missing values
The dataset contains information on 13,932 single-family homes sold in Miami in 2016. Besides publicly available information, the dataset creator Steven C. Bourassa has added distance variables,…
0 runs0 likes0 downloads0 reach0 impact
13932 instances - 17 features - 0 classes - 0 missing values
Students' Academic Performance Dataset (xAPI-Edu-Data) Data Set Characteristics: Multivariate Number of Instances: 480 Area: E-learning, Education, Predictive models, Educational Data Mining Attribute…
0 runs0 likes0 downloads0 reach0 impact
480 instances - 17 features - classes - 0 missing values
Context The subject of this dataset is multi-instrument observations of solar flares. There are a number of space-based instruments that are able to observe solar flares on the Sun; some instruments…
0 runs0 likes0 downloads0 reach0 impact
12455 instances - 17 features - classes - 0 missing values
The New York City Bike Share enables quick, easy, and affordable bike trips around the New York city boroughs. They make regular open data releases (this dataset is a transformed version of the data…
0 runs0 likes0 downloads0 reach0 impact
735502 instances - 17 features - classes - 0 missing values
Context I have gathered this dataset over the course of 8 years and put a lot of effort in it (see soccerverse.com). If you use the data for any kind of project, please drop me a line or ping me on…
0 runs0 likes0 downloads0 reach0 impact
1078214 instances - 17 features - classes - 4031 missing values
Context Explore the regression algorithm using the prices of Lisbon's houses. This dataset contains a total of 246 records. Content The attributes of this dataset are: Id: is a unique identifying…
0 runs0 likes0 downloads0 reach0 impact
246 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
13488 instances - 17 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
13488 instances - 17 features - 2 classes - 0 missing values
Subsampling of the dataset house_16H (44123) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 2 classes - 0 missing values
Subsampling of the dataset house_16H (44123) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 2 classes - 0 missing values
Subsampling of the dataset house_16H (44123) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 2 classes - 0 missing values
Subsampling of the dataset house_16H (44123) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 2 classes - 0 missing values
Subsampling of the dataset house_16H (44123) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
520 instances - 17 features - classes - 0 missing values
33
0 runs0 likes0 downloads0 reach0 impact
8783 instances - 17 features - classes - 4314 missing values
This data set contains demographic, geographic, and severity information for all confirmed and probable cases reported to and managed by Toronto Public Health since the first case was reported in…
0 runs0 likes0 downloads0 reach0 impact
14911 instances - 17 features - classes - 1212 missing values
Context A compilation of all the development related courses ( 10 thousand courses) which are available on Udemy's website. Under the development category, there are courses from Web Development, Data…
0 runs0 likes0 downloads0 reach0 impact
9932 instances - 17 features - classes - 336 missing values
Context A compilation of all the development related courses ( 10 thousand courses) which are available on Udemy's website. Under the development category, there are courses from Web Development, Data…
0 runs0 likes0 downloads0 reach0 impact
9932 instances - 17 features - classes - 336 missing values
National Longitudinal Survey (Binarized) Survey results from the U.S. Bureau of Labor Statistics to gather information on the labour market activities and other life events of several groups. The…
0 runs0 likes0 downloads0 reach0 impact
4908 instances - 17 features - 2 classes - 0 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
0 runs0 likes0 downloads0 reach0 impact
45211 instances - 17 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Context This is historical data on cryptocurrency tradings for the period from 2016-01-01 to 2021-02-21. If you enjoy this dataset please upvote so I can see it is popular and I need to update it.…
0 runs1 likes0 downloads1 reach0 impact
2382643 instances - 17 features - classes - 4862194 missing values
This case requires to develop a customer segmentation to define marketing strategy. The sample Dataset summarizes the usage behavior of about 9000 active credit card holders during the last 6 months.…
0 runs0 likes0 downloads0 reach0 impact
8950 instances - 17 features - classes - 314 missing values
Data Set Information: This has been collected using direct questionnaires from the patients of Sylhet Diabetes Hospital in Sylhet, Bangladesh and approved by a doctor. Data Set Information: This has…
0 runs0 likes0 downloads0 reach0 impact
520 instances - 17 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
581835 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
581835 instances - 17 features - 0 classes - 0 missing values
Subsampling of the dataset segment (40984) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 7 classes - 0 missing values
Subsampling of the dataset segment (40984) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 7 classes - 0 missing values
Subsampling of the dataset segment (40984) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 7 classes - 0 missing values
Subsampling of the dataset segment (40984) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 7 classes - 0 missing values
Subsampling of the dataset segment (40984) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 17 features - 7 classes - 0 missing values
uci adult partitioned
0 runs0 likes0 downloads0 reach0 impact
48844 instances - 17 features - classes - 6495 missing values
Identify jets of particles from the LHC, created for the study of ultra low latency inference with hls4ml. Use 16 high level features to identify the 5 jet classes: quark (q), gluon (g), W boson (w),…
0 runs0 likes0 downloads0 reach0 impact
830000 instances - 17 features - 5 classes - 0 missing values
Context Pantheon is a project celebrating the cultural information that endows our species with these fantastic capacities. To celebrate our global cultural heritage we are compiling, analyzing and…
0 runs0 likes0 downloads0 reach0 impact
11341 instances - 17 features - classes - 11329 missing values
Acknowledgements The data was scraped from Booking.com. All data in the file is publicly available to everyone already. Data is originally owned by Booking.com. Please contact me through my profile if…
0 runs0 likes0 downloads0 reach0 impact
515738 instances - 17 features - classes - 6536 missing values
Context Find the best strategies to improve for the next marketing campaign. How can the financial institution have a greater effectiveness for future marketing campaigns? In order to answer this, we…
0 runs1 likes1 downloads2 reach0 impact
11162 instances - 17 features - classes - 0 missing values
Context: The Consumer Price Indexes (CPI) program produces monthly data on changes in the prices paid by urban consumers for a representative basket of goods and services. It is a useful way to…
0 runs0 likes0 downloads0 reach0 impact
4349 instances - 17 features - classes - 218 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
22784 instances - 17 features - 0 classes - 0 missing values
Database of baseball players and play statistics, including 'Games_played', 'At_bats', 'Runs', 'Hits', 'Doubles', 'Triples', 'Home_runs', 'RBIs', 'Walks', 'Strikeouts', 'Batting_average',…
795 runs0 likes0 downloads0 reach0 impact
1340 instances - 17 features - 3 classes - 20 missing values
This database was designed on the basis of data provided by US Census Bureau [http://www.census.gov] (under Lookup Access [http://www.census.gov/cdrom/lookup]: Summary Tape File 1). The data were…
3 runs0 likes0 downloads0 reach0 impact
22784 instances - 17 features - 0 classes - 0 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
519 runs0 likes0 downloads0 reach0 impact
203 instances - 17 features - 11 classes - 0 missing values
We create a digit database by collecting 250 samples from 44 writers. The samples, written by 30 writers, are used for training, cross-validation and writer dependent testing, and the digits written…
37686 runs0 likes0 downloads0 reach0 impact
10992 instances - 17 features - 10 classes - 0 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 17 features - 0 classes - 0 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 17 features - 0 classes - 0 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 17 features - 0 classes - 0 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 17 features - 2 classes - 0 missing values
A simple database containing 17 Boolean-valued attributes describing animals. The "type" attribute appears to be the class attribute. Notes: * I find it unusual that there are 2 instances of "frog"…
191 runs0 likes0 downloads0 reach0 impact
101 instances - 17 features - 7 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - classes - 0 missing values
This dataset contains all Premier League matches, with player statistic take from Sofifa, from 2008 to 2016
0 runs0 likes0 downloads0 reach0 impact
2961 instances - 17 features - classes - 0 missing values
### Context Since its inception in 2008, Airbnb has disrupted the traditional hospitality industry as more travellers decide to use Airbnb as their primary means of accommodation. Airbnb offers…
0 runs0 likes0 downloads0 reach0 impact
226030 instances - 17 features - classes - 213880 missing values
Content All data was collected via my Garmin GPS watch and uploaded to Garmin Connect via Bluetooth. There are gaps in my log, the largest being from summer of 2019 to about spring 2020 (I ran…
0 runs0 likes0 downloads0 reach0 impact
689 instances - 17 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
22784 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
22784 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 17 features - 0 classes - 0 missing values
Vulnerability Dataset For Binary Classification
0 runs0 likes0 downloads0 reach0 impact
5692 instances - 17 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
22784 instances - 17 features - 0 classes - 0 missing values
#study_1
0 runs0 likes0 downloads0 reach0 impact
944 instances - 17 features - classes - 0 missing values
will be updated
0 runs0 likes0 downloads0 reach0 impact
2111 instances - 17 features - classes - 0 missing values
No data.
71 runs0 likes5 downloads5 reach12 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
291 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 7 classes - 0 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 17 features - 0 classes - 76 missing values
No data.
356 runs0 likes0 downloads0 reach0 impact
131072 instances - 17 features - 2 classes - 0 missing values
No data.
67 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
293 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
No data.
332 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
311 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
65709 runs3 likes41 downloads44 reach33 impact
45211 instances - 17 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
189 instances - 17 features - classes - 0 missing values
Description: The Electric_Vehicle_Population_Data.csv dataset provides a comprehensive overview of the electric vehicle (EV) population within a specific region, highlighting key information about the…
0 runs0 likes0 downloads0 reach0 impact
181458 instances - 17 features - classes - 421 missing values
Description: The dataset named "ObesityDataSet_raw_and_data_sinthetic.csv" comprises comprehensive attributes aimed at the analysis and prediction of obesity levels among individuals. It captures a…
0 runs0 likes0 downloads0 reach0 impact
2111 instances - 17 features - classes - 0 missing values
No data.
50 runs0 likes0 downloads0 reach0 impact
1000000 instances - 18 features - 22 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Jura (Goovaerts 1997) dataset consists of measurements of concentrations of seven heavy…
0 runs0 likes0 downloads0 reach0 impact
359 instances - 18 features - classes - 0 missing values
Testing this plattform
0 runs0 likes0 downloads0 reach0 impact
36203 instances - 18 features - 0 classes - 8971 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Electrical Discharge Machining dataset (Karalic and Bratko 1997) represents a two-target…
0 runs0 likes0 downloads0 reach0 impact
154 instances - 18 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Electrical Discharge Machining dataset (Karalic and Bratko 1997) represents a two-target…
0 runs0 likes0 downloads0 reach0 impact
154 instances - 18 features - classes - 0 missing values
Context Kerala is a state in South India known for its beautiful beaches and backwaters. Also known as 'God's own country' it was named one of the 10 paradises of the world by National Geographic. Due…
0 runs0 likes0 downloads0 reach0 impact
117 instances - 18 features - classes - 0 missing values
The dataset consists of feature vectors belonging to 12,330 sessions.The dataset was formed so that each sessionwould belong to a different user in a 1-year period to avoidany tendency to a specific…
0 runs0 likes0 downloads0 reach0 impact
12330 instances - 18 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
16599 instances - 18 features - classes - 0 missing values
This data set includes hourly air pollutants data from 12nationally-controlled air-quality monitoring sites.
0 runs0 likes0 downloads0 reach0 impact
420768 instances - 18 features - classes - 74027 missing values
This data set measures the running time of a matrix-matrix product A x B = C, where all matrices have size 2048 x 2048, using a parameterizable SGEMM GPU kernel with 241600 possible parameter…
0 runs0 likes0 downloads0 reach0 impact
241600 instances - 18 features - classes - 0 missing values
Context As a fan of anime (Japanese animated media), I have always wanted to have information about all the anime I can get my hands on. Content This data was scraped from anime-planet on June 15,…
0 runs0 likes0 downloads0 reach0 impact
14578 instances - 18 features - classes - 28902 missing values
Data is pulled from SDSS Skyserver from data release 16 using the following query. SELECT p.objid,p.ra,p.dec,p.u,p.g,p.r,p.i,p.z, p.run, p.rerun, p.camcol, p.field, s.specobjid, s.class, s.z as…
0 runs0 likes0 downloads0 reach0 impact
732977 instances - 18 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
21613 instances - 18 features - 0 classes - 0 missing values