Data
Filter results by:
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
965 runs0 likes0 downloads0 reach0 impact
137 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
739 runs0 likes0 downloads0 reach0 impact
4052 instances - 8 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
782 runs0 likes0 downloads0 reach0 impact
70 instances - 8 features - 2 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach0 impact
48 instances - 8 features - 0 classes - 0 missing values
* Title: seismic-bumps Data Set * Abstract: The data describe the problem of high energy (higher than 10^4 J) seismic bumps forecasting in a coal mine. Data come from two of longwalls located in a…
152 runs0 likes0 downloads0 reach0 impact
210 instances - 8 features - 3 classes - 0 missing values
* Title: seeds Data Set * Abstract: Measurements of geometrical properties of kernels belonging to three different varieties of wheat. A soft X-ray technique and GRAINS package were used to construct…
190 runs0 likes0 downloads0 reach0 impact
210 instances - 8 features - 3 classes - 0 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 8 features - 0 classes - 12 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
104 runs0 likes0 downloads0 reach0 impact
379 instances - 8 features - 2 classes - 1368 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
815 runs0 likes0 downloads0 reach0 impact
336 instances - 8 features - 2 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
209 instances - 8 features - classes - 0 missing values
Dataset Title: Localization Data for Person Activity Data Set Abstract: Data contains recordings of five people performing different activities. Each person wore four sensors (tags) while performing…
6 runs0 likes0 downloads0 reach0 impact
164860 instances - 8 features - 11 classes - 0 missing values
Sensor data measurements of one Boiler, containing WaterInput/SteamOutput (flow, temperature, pressure) for one month, which is measured every minute.
0 runs0 likes0 downloads0 reach0 impact
44643 instances - 8 features - classes - 44643 missing values
The goal is to predict the Fare. Variable description: pclass: A proxy for socio-economic status (SES) 1st = Upper 2nd = Middle 3rd = Lower age: Age is fractional if less than 1. If the age is…
0 runs0 likes4 downloads4 reach11 impact
1307 instances - 8 features - 0 classes - 0 missing values
ecoli-pmlb
31 runs0 likes0 downloads0 reach0 impact
327 instances - 8 features - 5 classes - 0 missing values
led7-pmlb
31 runs0 likes0 downloads0 reach0 impact
3200 instances - 8 features - 10 classes - 0 missing values
"The debutanizer column is part of a desulfuring and naphtha splitter plant." u1 Top temperature u2 Top pressure u3 Reflux flow u4 Flow to next process u5 6th tray temperature u6 Bottom…
0 runs0 likes0 downloads0 reach0 impact
2394 instances - 8 features - 0 classes - 0 missing values
This simple domain contains 7 Boolean attributes and 10 classes, the set of decimal digits. Recall that LED displays contain 7 light-emitting diodes -- hence the reason for 7 attributes. The class…
13156 runs0 likes10 downloads10 reach19 impact
500 instances - 8 features - 10 classes - 0 missing values
Data
0 runs0 likes0 downloads0 reach0 impact
539383 instances - 8 features - 2 classes - 0 missing values
Context As a russian league of legends player, i love LCL so much and i wanted to make a dataset linked with data from this league. Content Match data which documents all the regular season games of…
0 runs0 likes0 downloads0 reach0 impact
114 instances - 8 features - classes - 57 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure.
0 runs0 likes0 downloads0 reach0 impact
539383 instances - 8 features - 0 classes - 0 missing values
exercises
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 8 features - classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 8 features - classes - 0 missing values
This collection includes data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the biceps brachii muscles of 21 healthy volunteers. The annotation was performed by labeling the…
0 runs0 likes0 downloads0 reach0 impact
347 instances - 8 features - classes - 0 missing values
This collection includes data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the biceps brachii muscles of a single healthy volunteer. The annotation was performed by labeling…
0 runs0 likes0 downloads0 reach0 impact
318 instances - 8 features - classes - 0 missing values
This is a dataset about the actors and actresses that have voice characters in Disney movies
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - 2 classes - 0 missing values
This is a dataset about the academic performance of students in different subjects
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - 2 classes - 0 missing values
Context While I was practicing machine learning, I wanted to create a simple dataset that is closely aligned to the real world scenario and gives better results to whet my appetite on this domain. If…
0 runs0 likes0 downloads0 reach0 impact
5001 instances - 8 features - classes - 0 missing values
About the dataset In this dataset you can find prices data from the biggest brazillian stock index(IBOV) from 1992 to 2019 and also day of the week and month informations: date: date in format…
0 runs0 likes0 downloads0 reach0 impact
6887 instances - 8 features - classes - 0 missing values
Context Thousands of cryptocurrencies have sprung up in the past few years. Can you predict which one will be the next BTC? Content The dataset contains daily opening, high, low, close, and trading…
0 runs0 likes0 downloads0 reach0 impact
567769 instances - 8 features - classes - 0 missing values
jobScheduling: HPC Job Scheduling Data In AppliedPredictiveModeling: Functions and Data Sets for 'Applied Predictive Modeling'
0 runs0 likes0 downloads0 reach0 impact
4331 instances - 8 features - 4 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
38474 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset bank-marketing (44126) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset electricity (44120) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset electricity (44120) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach0 impact
523590 instances - 8 features - 144 classes - 6916 missing values
exercises
0 runs0 likes0 downloads0 reach0 impact
15000 instances - 8 features - classes - 0 missing values
1
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - classes - 0 missing values
19Realestatevaluation
0 runs0 likes0 downloads0 reach0 impact
414 instances - 8 features - classes - 0 missing values
#StudentPerfromance
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 8 features - 2 classes - 0 missing values
This is a historical data of HangSeng Futures Index based in Hong Kong. For non traders, the data is a time-series (sequential flow of numbers) describing the HangSeng Futures Index of HongKong. Every…
0 runs0 likes0 downloads0 reach0 impact
87645 instances - 8 features - classes - 0 missing values
Context This Dataset, consist of Tweets regarding Machine Learning, posted by Prof.Andrew Ng, Co-founder of Coursera, and an Adjunct Professor of Computer Science at Stanford University. His machine…
0 runs0 likes0 downloads0 reach0 impact
1234 instances - 8 features - classes - 1 missing values
Context Inspired by the New York City Taxi Trip Duration playground I created a dataset using the publicly available data from this link). Citi Bike is a bike sharing service available in New York…
0 runs0 likes0 downloads0 reach0 impact
4500000 instances - 8 features - classes - 0 missing values
Context This Data set Contains the values of Stock of Amazon which dates between 15 July ,2019 to 15 July,2020 Content The Data set contains 7 different columns that includes Date, The opening value…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 8 features - classes - 23 missing values
Context This dataset was created by our in house teams at PromptCloud(https://www.promptcloud.com/) and DataStock(https://datastock.shop/). We have about 5K samples in this dataset. You can download…
0 runs0 likes0 downloads0 reach0 impact
741 instances - 8 features - classes - 1575 missing values
Context This dataset was created by our in house teams at PromptCloud(https://www.promptcloud.com/) and DataStock(https://datastock.shop/). We have about 5K samples in this dataset. You can download…
0 runs0 likes0 downloads0 reach0 impact
741 instances - 8 features - classes - 1575 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
4177 instances - 8 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on both numerical and categorical…
0 runs0 likes0 downloads0 reach0 impact
24780 instances - 8 features - 2 classes - 0 missing values
Test
0 runs0 likes0 downloads0 reach0 impact
6330 instances - 8 features - classes - 0 missing values
r fgtgt
0 runs0 likes0 downloads0 reach0 impact
2 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - classes - 0 missing values
this is titanic survival prediction
0 runs0 likes4 downloads4 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes3 downloads3 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
this is titanic survival prediction
0 runs0 likes5 downloads5 reach7 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
6 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - 0 classes - 0 missing values
titanic surviual prediction
0 runs0 likes0 downloads0 reach0 impact
891 instances - 8 features - 0 classes - 0 missing values
service data
0 runs0 likes0 downloads0 reach0 impact
34 instances - 8 features - classes - 0 missing values
tesl dataset about L
0 runs0 likes0 downloads0 reach0 impact
150000 instances - 8 features - classes - 0 missing values
Content The SP BSE SENSEX (SP Bombay Stock Exchange Sensitive Index), also called the BSE 30 or simply the SENSEX, is a free-float market-weighted stock market index of 30 well-established and…
0 runs0 likes0 downloads0 reach0 impact
73316 instances - 8 features - classes - 1500 missing values
Context This Data set Contains the values of Stock of Amazon which dates between 1st October,2019 to 14 July,2020 Content The Data set contains 7 different columns that includes Date, The opening…
0 runs0 likes0 downloads0 reach0 impact
202 instances - 8 features - classes - 13 missing values
Context It's always interesting to analyse what's going on in the news but there's no good data avaialable in indian context on kaggle, so I created one. Content Apart from the main content of the…
0 runs0 likes0 downloads0 reach0 impact
1568 instances - 8 features - classes - 33 missing values
Context Emotions Detection Is an Interesting Blend of Psychology and Technology. -This Technology Helps to Build a Companion Robots,This Robots Can be Friendly and Have The Ability to Recognize Users…
0 runs0 likes0 downloads0 reach0 impact
1104 instances - 8 features - classes - 0 missing values
Context 202.645 tweets collected with Twitter API by using the following hashtags and keywords, distancelearning, onlineschool, onlineteaching, virtuallearning, onlineducation, distanceeducation,…
0 runs0 likes0 downloads0 reach0 impact
202645 instances - 8 features - classes - 48656 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
4052 instances - 8 features - 0 classes - 0 missing values
This data represents crime reported to the Seattle Police Department (SPD). Each row contains the record of a unique event where at least one criminal offense was reported by a member of the community…
0 runs0 likes0 downloads0 reach0 impact
52358 instances - 8 features - 0 classes - 650 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure. For this…
0 runs0 likes0 downloads0 reach0 impact
26969 instances - 8 features - 2 classes - 0 missing values
test3
0 runs0 likes0 downloads0 reach0 impact
2 instances - 8 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
336 instances - 8 features - classes - 0 missing values
Context Recent growing interest in cryptocurrencies, specifically as a speculative investment vehicle, has sparked global conversation over the past 12 months. Although this data is available across…
0 runs0 likes0 downloads0 reach0 impact
28944 instances - 8 features - classes - 0 missing values
Context Towards Data Science Inc. is a corporation registered in Canada as a platform for thousands of people to exchange ideas and to expand understanding of data science. This dataset contains data…
0 runs0 likes0 downloads0 reach0 impact
46079 instances - 8 features - classes - 3 missing values
Context As Al Hilal football club is one of the most successful football clubs in Saudi Arabia, Middle East and Asia this dataset was an excercise for Web Scraping where i collected the archive of the…
0 runs0 likes0 downloads0 reach0 impact
1363 instances - 8 features - classes - 219 missing values
This dataset has been scrapped off sa.aqar.fm to obtain land information such as price, size, street width, and locations. The uncleaned dataset scrapped 4347 rows, but seems like 1395 were duplicated…
0 runs0 likes0 downloads0 reach0 impact
2951 instances - 8 features - classes - 8085 missing values
Context Finding similarities between different types of public figures based in their Twitter activity. . Content usuario - username op = Openness to experience co =Conscientiousness ex = Extraversion…
0 runs0 likes0 downloads0 reach0 impact
140 instances - 8 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
4052 instances - 8 features - 0 classes - 0 missing values
Subsampling of the dataset bank-marketing (44126) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset bank-marketing (44126) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset bank-marketing (44126) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset bank-marketing (44126) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset airlines (1169) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset airlines (1169) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset airlines (1169) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values
Subsampling of the dataset airlines (1169) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 8 features - 2 classes - 0 missing values