Data
Filter results by:
Historical Rainfall data of Bangladesh
0 runs0 likes0 downloads0 reach0 impact
16755 instances - 4 features - 0 classes - 0 missing values
__Major changes w.r.t. version 2: ignored variable 3 in this upload as this seems to be ea perfect predictor.__ Tamilnadu Electricity Board Hourly Readings dataset. Real-time readings were collected…
0 runs0 likes2 downloads2 reach19 impact
45781 instances - 4 features - 20 classes - 0 missing values
This file holds global land temperatures by country
0 runs0 likes0 downloads0 reach0 impact
577462 instances - 4 features - classes - 64563 missing values
holds information on average temperature per country
0 runs0 likes0 downloads0 reach0 impact
577462 instances - 4 features - classes - 64563 missing values
asdasd
0 runs0 likes0 downloads0 reach0 impact
140 instances - 4 features - classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
Context We publish the data to clarify the real evolution of forest area in post-communist Romania. The data is from the National Statistics Institute of Romania, so these are the official reported…
0 runs0 likes0 downloads0 reach0 impact
8111 instances - 4 features - classes - 0 missing values
good
0 runs0 likes0 downloads0 reach0 impact
10 instances - 4 features - classes - 2 missing values
Content This is a dataset I started building for my future personal projects, as I think this kind of data is quite hard to acquire for free and in short time. I started acquiring data on March 21st,…
0 runs0 likes0 downloads0 reach0 impact
193279 instances - 4 features - classes - 29954 missing values
This a blend dataset that contains historic Swedish interest rates from 1908-2001 Source/Klla: Sveriges riksbank and Swedish inflation rate 1908-2001 fetched from Sweden's statistic central bureau…
0 runs0 likes0 downloads0 reach0 impact
109 instances - 4 features - classes - 49 missing values
Context Just made a scraper for stackoverflow, and created a dataset. Hope it will be useful for your task Content Contains 1 csv file, containing following columns question_vote_count : Number of…
0 runs0 likes0 downloads0 reach0 impact
1544049 instances - 4 features - classes - 0 missing values
This collection includes 21 data sets of one-dimensional ultrasound raw RF data (A-Scans) acquired from the calf muscles of 8 healthy volunteers. The subjects were asked to manually annotate the data…
0 runs0 likes0 downloads0 reach0 impact
212872 instances - 4 features - classes - 0 missing values
This dataset attributes first names to genders, giving counts and probabilities. It combines open-source government data from the US, UK, Canada, and Australia. This dataset combines raw counts for…
0 runs0 likes0 downloads0 reach0 impact
147269 instances - 4 features - classes - 0 missing values
Overview This dataset contains 3 million Sudoku puzzles and their solutions. The level of difficulty varies -- some can be solved easily by a beginner, while others will challenge experienced solvers.…
0 runs0 likes0 downloads0 reach0 impact
3000000 instances - 4 features - 0 classes - 0 missing values
Context Fake news has become one of the biggest problems of our age. It has serious impact on our online as well as offline discourse. One can even go as far as saying that, to date, fake news poses a…
0 runs0 likes0 downloads0 reach0 impact
6335 instances - 4 features - classes - 0 missing values
Context The dataset contains reviews from google playstore on snapchat. With the sentiment analysis, we can check for the users' adoption of andriod version of snapchat, which has been improved…
0 runs0 likes0 downloads0 reach0 impact
32875 instances - 4 features - classes - 0 missing values
A fake movie dataset.
0 runs0 likes0 downloads0 reach0 impact
14 instances - 4 features - 1 classes - 0 missing values
### This is a dataset with dummy description
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
### Description mini_insect_1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
We introduce AfriSenti, which consists of 14 sentiment datasets of 110,000+ tweets in 14 African languages (Amharic, Algerian Arabic, Hausa, Igbo, Kinyarwanda, Moroccan Arabic, Mozambican Portuguese,…
0 runs0 likes0 downloads0 reach0 impact
111720 instances - 4 features - 3 classes - 0 missing values
This dataset is an artificial simulation of the Duffing system with one phase transition to the chaotic regime.
0 runs0 likes0 downloads0 reach0 impact
9983 instances - 4 features - classes - 0 missing values
Last FM Dataset ------- Version ------- Version 1.0 (May 2011) ----------- Description ----------- This dataset contains social networking, tagging, and music artist listening information from a set…
0 runs0 likes0 downloads0 reach0 impact
289955 instances - 4 features - classes - 0 missing values
e eded
0 runs0 likes0 downloads0 reach0 impact
2 instances - 4 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
2178 instances - 4 features - classes - 0 missing values
leak detection file
0 runs0 likes0 downloads0 reach0 impact
23 instances - 4 features - classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
Context A dataset I used to classify tweets about my company. I took tweets and I classified them manually as positive, negative or neutral. Content There are 4 columns : Id : the tweed id, unique.…
0 runs0 likes0 downloads0 reach0 impact
1097 instances - 4 features - classes - 0 missing values
Content This data is an extract from a bigger reddit dataset (All reddit comments from May 2019, 157Gb or data uncompressed) that contains both more comments and more associated informations…
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 4 features - classes - 1 missing values
Data Description Dataset is from one of the leading travel site containing hotel reviews provided by customers. Variable Description User_ID unique ID of the customer Description description of the…
0 runs0 likes0 downloads0 reach0 impact
38932 instances - 4 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 4 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1190 runs0 likes0 downloads0 reach0 impact
111 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1202 runs0 likes0 downloads0 reach0 impact
100 instances - 4 features - 2 classes - 0 missing values
1. Title: Haberman's Survival Data 2. Sources: (a) Donor: Tjen-Sien Lim (limt@stat.wisc.edu) (b) Date: March 4, 1999 3. Past Usage: 1. Haberman, S. J. (1976). Generalized Residuals for Log-Linear…
3243 runs0 likes0 downloads0 reach0 impact
306 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
802 runs0 likes0 downloads0 reach0 impact
662 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1073 runs0 likes0 downloads0 reach0 impact
140 instances - 4 features - 2 classes - 0 missing values
This dataset contains a simulation of the Lorenz attractor with the parameter $\rho$ varying in time. The stable and chaotic regimes alternate.
0 runs0 likes0 downloads0 reach0 impact
4942 instances - 4 features - classes - 0 missing values
A subset of the 3D dataset from Princeton\'s COS 429 Computer Vision course. The dataset consists of 40 models organised into 4 classes of 10 objects each.
0 runs0 likes0 downloads0 reach0 impact
16000 instances - 4 features - classes - 0 missing values
whitewine
0 runs0 likes0 downloads0 reach0 impact
78 instances - 4 features - classes - 0 missing values
redwine dataset
0 runs0 likes0 downloads0 reach0 impact
571 instances - 4 features - classes - 0 missing values
redwine data
0 runs0 likes0 downloads0 reach0 impact
571 instances - 4 features - classes - 0 missing values
redwine data
0 runs0 likes0 downloads0 reach0 impact
571 instances - 4 features - classes - 0 missing values
red
0 runs0 likes0 downloads0 reach0 impact
571 instances - 4 features - classes - 0 missing values
Context It's the preprocessed train data from Quora Insincere Questions competition 2018 The original train data is preprocessed to remove stop words, numbers, punctuations, common words and converted…
0 runs0 likes0 downloads0 reach0 impact
1306122 instances - 4 features - classes - 1 missing values
Context Amazon.com is one of the largest electronic commerce and cloud computing companies. Just a few Amazon related facts They lost 4.8 million in August 2013, when their website went down for 40…
0 runs0 likes0 downloads0 reach0 impact
2023070 instances - 4 features - classes - 0 missing values
Context Throughout the world of data science, there are many languages and tools that can be used to complete a given task. While you are often able to use whichever tool you prefer, it is often…
0 runs0 likes0 downloads0 reach0 impact
10153 instances - 4 features - classes - 9824 missing values
The number of children, youth and adults not attending schools or universities because of COVID-19 is soaring. Governments all around the world have closed educational institutions in an attempt to…
0 runs0 likes0 downloads0 reach0 impact
41714 instances - 4 features - classes - 0 missing values
About our Dataset The journey of the collection of this Covid-19 India dataset begin with a competition where we have to do sentiment analysis of tweets. The data was collected from…
0 runs0 likes0 downloads0 reach0 impact
648958 instances - 4 features - classes - 10980 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 4 features - 0 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 4 features - 0 classes - 0 missing values
PMLB version of the Titanic dataset, which only uses 3 features. See version 1 for the complete version: https://www.openml.org/d/40945
35 runs0 likes2 downloads2 reach23 impact
2201 instances - 4 features - 2 classes - 0 missing values
analcatdata_happiness-pmlb
31 runs0 likes0 downloads0 reach0 impact
60 instances - 4 features - 3 classes - 0 missing values
Test dataset to see upload.
0 runs0 likes0 downloads0 reach0 impact
73503 instances - 4 features - 2 classes - 0 missing values
Predicting forest cover ...
0 runs0 likes0 downloads0 reach0 impact
73503 instances - 4 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
73503 instances - 4 features - classes - 0 missing values
fake dataset without any value
0 runs0 likes0 downloads0 reach0 impact
73503 instances - 4 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
35717 instances - 4 features - classes - 0 missing values
No data.
328 runs0 likes0 downloads0 reach0 impact
1000000 instances - 4 features - 2 classes - 0 missing values
No data.
330 runs0 likes0 downloads0 reach0 impact
1000000 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1268 runs0 likes11 downloads11 reach14 impact
131 instances - 4 features - 2 classes - 0 missing values
adas
0 runs0 likes0 downloads0 reach0 impact
902 instances - 4 features - 2 classes - 0 missing values
Description: The dataset contains information about various products, their stock levels, prices, and the locations where they are sold. Columns description: 1. Product: Represents the name of the…
0 runs0 likes0 downloads0 reach0 impact
73503 instances - 4 features - classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
578 instances - 4 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on both numerical and categorical…
1 runs0 likes0 downloads0 reach0 impact
163065 instances - 4 features - 0 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 0 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
366 instances - 5 features - classes - 2 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
35 runs0 likes0 downloads0 reach0 impact
23 instances - 5 features - 3 classes - 0 missing values
One of the datasets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff. It contains data on the DMFT Index (Decayed, Missing, and Filled Teeth) before and after different prevention…
27866 runs0 likes0 downloads0 reach0 impact
797 instances - 5 features - 6 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
886 runs0 likes0 downloads0 reach0 impact
264 instances - 5 features - 2 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
48 instances - 5 features - 0 classes - 0 missing values
This dataset is synthetic. It was generated by David Coleman at RCA Laboratories in Princeton, N.J. For convenience, we will refer to it as the POLLEN DATA. The first three variables are the lengths…
0 runs0 likes0 downloads0 reach0 impact
3848 instances - 5 features - 0 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Gasoline comnsumption is being treated as…
2 runs0 likes0 downloads0 reach0 impact
27 instances - 5 features - 0 classes - 0 missing values
DATA-SETS FROM DIGGLE, P.J. (1990). TIME SERIES : A BIOSTATISTICAL INTRODUCTION. Oxford University Press. Table: Table A1 Lutenizing hormone Information about the dataset CLASSTYPE: numeric…
0 runs0 likes0 downloads0 reach0 impact
48 instances - 5 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
2 runs0 likes0 downloads0 reach0 impact
8641 instances - 5 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
0 runs0 likes0 downloads0 reach0 impact
323 instances - 5 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
709 runs0 likes0 downloads0 reach0 impact
48 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
1139 runs0 likes0 downloads0 reach0 impact
132 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
1136 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 2 classes - 0 missing values
Hayes-Roth Database This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Source…
384 runs0 likes0 downloads0 reach0 impact
160 instances - 5 features - 3 classes - 0 missing values
Data originating from the book "Analyzing Categorical Data" by Jeffrey S. Simonoff.
1087 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 2 classes - 0 missing values
A shar archive of data from the book Data Analysis: An Introduction(1992) Prentice Hall bu Jeff Witmer. Submitted by Jeff Witmer (fwitmer@ocvaxa.cc.oberlin.edu) [28/Jun/94] (29 kbytes) Note:…
2 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
773 runs0 likes0 downloads0 reach0 impact
8641 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
842 runs0 likes0 downloads0 reach0 impact
323 instances - 5 features - 2 classes - 0 missing values
* Dataset Title: Wall-Following Robot Navigation Data Data Set (version with 4 Attributes) * Abstract: The data were collected as the SCITOS G5 robot navigates through the room following the wall in a…
138 runs0 likes0 downloads0 reach0 impact
5456 instances - 5 features - 4 classes - 0 missing values
This data set was generated to model psychological experimental results. Each example is classified as having the balance scale tip to the right, tip to the left, or be balanced. The attributes are…
30115 runs0 likes0 downloads0 reach0 impact
625 instances - 5 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1043 runs0 likes0 downloads0 reach0 impact
125 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
707 runs0 likes0 downloads0 reach0 impact
96 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
752 runs0 likes0 downloads0 reach0 impact
48 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
779 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
774 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
812 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
769 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 2 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
3 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 0 classes - 0 missing values
The dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in the wild over a predefined path. The dataset is…
80 runs0 likes0 downloads0 reach0 impact
149332 instances - 5 features - 22 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. !!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! NAME: Sexual activity and the lifespan of male fruitflies TYPE: Designed (almost factorial)…
4 runs0 likes0 downloads0 reach0 impact
125 instances - 5 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Points scored per minute is being treated as…
2 runs0 likes0 downloads0 reach0 impact
96 instances - 5 features - 0 classes - 0 missing values