Data
Filter results by:
analcatdata_happiness-pmlb
31 runs0 likes0 downloads0 reach0 impact
60 instances - 4 features - 3 classes - 0 missing values
No data.
291 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 7 classes - 0 missing values
No data.
309 runs0 likes0 downloads0 reach0 impact
1000000 instances - 35 features - 6 classes - 0 missing values
## **Meta-Album DIBaS Dataset (Extended)** The Digital Images of Bacteria Species dataset (DIBaS) (https://github.com/gallardorafael/DIBaS-Dataset) is a dataset of 33 bacterial species with around 20…
0 runs0 likes0 downloads0 reach1 impact
4060 instances - 3 features - 33 classes - 4060 missing values
Description: The "Thyroid_Diff.csv" dataset is a comprehensive collection of clinical data relating to thyroid diseases. With attributes capturing a wide range of information from patient demographics…
0 runs0 likes0 downloads0 reach0 impact
383 instances - 17 features - classes - 0 missing values
Tiny ImageNet contains 100000 images of 200 classes (500 for each class) downsized to 64 x 64 colored images. !!! This dataset only links to 20 images per class (instead of the usual 500) and is ONLY…
0 runs0 likes0 downloads0 reach0 impact
4000 instances - 3 features - 0 classes - 0 missing values
Tiny ImageNet contains 100000 images of 200 classes (500 for each class) downsized to 64 x 64 colored images. Each class has 500 training images, 50 validation images, and 50 test images. The dataset…
0 runs0 likes0 downloads0 reach0 impact
100000 instances - 3 features - 0 classes - 0 missing values
For detecting fraudulent transactions
0 runs0 likes0 downloads0 reach0 impact
100 instances - 8 features - classes - 0 missing values
A simple database containing 17 Boolean-valued attributes describing animals. The "type" attribute appears to be the class attribute. Notes: * I find it unusual that there are 2 instances of "frog"…
191 runs4 likes22 downloads26 reach9 impact
101 instances - 17 features - 7 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
899 runs0 likes8 downloads8 reach14 impact
130 instances - 3 features - 5 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
3 instances - 2 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
raw tweets on ChatGPT
0 runs0 likes0 downloads0 reach0 impact
219294 instances - 3 features - classes - 0 missing values
Description: The `olist_customers_dataset.csv` offers a comprehensive snapshot of customer details from the Olist e-commerce platform. This dataset encapsulates essential customer information,…
0 runs0 likes0 downloads0 reach0 impact
99441 instances - 5 features - classes - 0 missing values
Description: The 'heartrate_seconds_merged.csv' dataset is a comprehensive compilation of heart rate measurements collected over varying days and times, intended to provide insights into heart rate…
0 runs0 likes0 downloads0 reach0 impact
1154681 instances - 3 features - classes - 0 missing values
The dataset 'minuteStepsNarrow_merged.csv' provides detailed insights into step activity recorded for various users, identified by their unique IDs, over specific minutes of different days. The data…
0 runs0 likes0 downloads0 reach0 impact
1445040 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
From original source: ----- As part of our continued commitment towards the WWW community and its success, we are again making available the entire data sets for this set of surveys. This enables…
0 runs0 likes0 downloads0 reach0 impact
10108 instances - 69 features - 2 classes - 309 missing values
car data
0 runs0 likes0 downloads0 reach0 impact
22 instances - 3 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
581 runs0 likes5 downloads5 reach14 impact
400 instances - 6 features - 4 classes - 0 missing values
Upstream data from the twin Archimedes screw hydro-electric generator on the river Thames at Caversham weir, Reading, UK.
0 runs0 likes0 downloads0 reach0 impact
235 instances - 2 features - 0 classes - 0 missing values
Context This dataset was created by our in house Web Scraping and Data Mining teams at PromptCloud and DataStock. You can download the full dataset here. This sample contains 30K records. Content…
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 16 features - classes - 49458 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
2 runs0 likes0 downloads0 reach0 impact
108 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
450 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
17x17x2x2 tables of counts in GLIM-ready format used for the analyses in Biblarz, Timothy J., and Adrian E. Raftery. 1993. "The Effects of Family Disruption on Social Mobility." American Sociological…
6 runs0 likes0 downloads0 reach0 impact
1156 instances - 6 features - 0 classes - 0 missing values
Data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984), and reanalyzed by Raftery and Hout (1985, 1993). ###…
16174 runs0 likes0 downloads0 reach0 impact
500 instances - 6 features - 2 classes - 32 missing values
Datasets for `Pattern Recognition and Neural Networks' by B.D. Ripley ===================================================================== Cambridge University Press (1996) ISBN 0-521-46086-7 The…
41 runs0 likes0 downloads0 reach0 impact
27 instances - 3 features - 4 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
366 instances - 5 features - classes - 2 missing values
Dataset from `Pattern Recognition and Neural Networks' by B.D. Ripley. Cambridge University Press (1996) ISBN 0-521-46086-7. The background to the datasets is described in section 1.4; this file…
1105 runs0 likes0 downloads0 reach0 impact
250 instances - 3 features - 2 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
100 instances - 10 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
100 instances - 3 features - 0 classes - 0 missing values
Human Development Index [DATA] United Nations Development Program compiled an Index of Human Development. Column 1: Country(character) 2: Index 3: GNP GNP PER CAPITA RANK RANK - RANK HDI 1987 GNP RANK…
2 runs0 likes0 downloads0 reach0 impact
130 instances - 2 features - 0 classes - 0 missing values
The data consist of 2001 observations taken from a balloon about 30 kilometres above the surface of the earth. In the section of the flight shown here the balloon increases in height. As radiation…
0 runs0 likes0 downloads0 reach0 impact
2001 instances - 2 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
649 instances - 3 features - 0 classes - 0 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Electicity usage is being treated as the…
4 runs0 likes0 downloads0 reach0 impact
55 instances - 3 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 0 classes - 9 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
2 runs0 likes0 downloads0 reach0 impact
264 instances - 3 features - 0 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach0 impact
50 instances - 3 features - classes - 0 missing values
Data originating from the book "Analyzing Categorical Data" by Jeffrey S. Simonoff.
1087 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 2 classes - 0 missing values
Grass Grubs and Damage Ranking Data source: R. J. Townsend AgResearch, Lincoln, New Zealand Grass grubs are one of the major insect pests of pasture in Canterbury and can cause severe pasture damage…
988 runs0 likes0 downloads0 reach0 impact
155 instances - 9 features - 4 classes - 0 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
599 runs0 likes0 downloads0 reach0 impact
1024 instances - 3 features - 4 classes - 0 missing values
File README ----------- chscase A collection of the data sets used in the book "A Casebook for a First Course in Statistics and Data Analysis," by Samprit Chatterjee, Mark S. Handcock and Jeffrey S.…
0 runs0 likes0 downloads0 reach0 impact
185 instances - 2 features - classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
123 runs0 likes0 downloads0 reach0 impact
46 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1088 runs0 likes0 downloads0 reach0 impact
132 instances - 4 features - 2 classes - 0 missing values
An artificial data set where instances belongs to several clusters with a banana shape. There are two attributes At1 and At2 corresponding to the x and y axis, respectively. The class label (-1 and 1)…
163 runs0 likes0 downloads0 reach0 impact
5300 instances - 3 features - 2 classes - 0 missing values
* Dataset Title: Wall-Following Robot Navigation Data Data Set (version with 2 Attributes) * Abstract: The data were collected as the SCITOS G5 robot navigates through the room following the wall in a…
109 runs0 likes0 downloads0 reach0 impact
5456 instances - 3 features - 4 classes - 0 missing values
ARFF version of UCI dataset 'flags'. Creators: Collected primarily from the "Collins Gem Guide to Flags": Collins Publishers (1986). Donor: Richard S. Forsyth. Date 5/15/1990 This data file contains…
108 runs0 likes0 downloads0 reach0 impact
194 instances - 29 features - 8 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
119 runs0 likes0 downloads0 reach0 impact
39 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1164 runs0 likes0 downloads0 reach0 impact
222 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1038 runs0 likes0 downloads0 reach0 impact
147 instances - 7 features - 2 classes - 0 missing values
%-*- text -*- %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage…
2 runs0 likes0 downloads0 reach0 impact
60 instances - 17 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
771 runs0 likes0 downloads0 reach0 impact
468 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1043 runs0 likes0 downloads0 reach0 impact
125 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
111 runs0 likes0 downloads0 reach0 impact
52 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
903 runs0 likes0 downloads0 reach0 impact
468 instances - 3 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
767 runs0 likes0 downloads0 reach0 impact
189 instances - 10 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
119 runs0 likes0 downloads0 reach0 impact
50 instances - 4 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
762 runs0 likes0 downloads0 reach0 impact
88 instances - 3 features - 2 classes - 0 missing values
Dataset from the MLRR repository: http://axon.cs.byu.edu:5000/
731 runs0 likes0 downloads0 reach0 impact
151 instances - 7 features - 3 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
116640 instances - 10 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning…
10 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 0 classes - 2 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
140 runs0 likes0 downloads0 reach0 impact
194 instances - 29 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
842 runs0 likes0 downloads0 reach0 impact
155 instances - 9 features - 2 classes - 0 missing values
Abstract: The data set is composed of 60 chorales (5665 events) by J.S. Bach (1675-1750). Each event of each chorale is labelled using 1 among 101 chord labels and described through 14 features.…
31 runs0 likes0 downloads0 reach0 impact
5665 instances - 17 features - 102 classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach0 impact
50789 instances - 20 features - 3 classes - 154107 missing values
The dataset freMTPL2sev contains claim amounts for 26,639 motor third-part liability policies.
0 runs0 likes0 downloads0 reach0 impact
26639 instances - 2 features - classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Historical Rainfall data of Bangladesh
0 runs0 likes0 downloads0 reach0 impact
16755 instances - 4 features - 0 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
__Major changes w.r.t. version 2: ignored variable 3 in this upload as this seems to be ea perfect predictor.__ Tamilnadu Electricity Board Hourly Readings dataset. Real-time readings were collected…
0 runs0 likes2 downloads2 reach19 impact
45781 instances - 4 features - 20 classes - 0 missing values
This file holds global land temperatures by country
0 runs0 likes0 downloads0 reach0 impact
577462 instances - 4 features - classes - 64563 missing values
holds information on average temperature per country
0 runs0 likes0 downloads0 reach0 impact
577462 instances - 4 features - classes - 64563 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Context We publish the data to clarify the real evolution of forest area in post-communist Romania. The data is from the National Statistics Institute of Romania, so these are the official reported…
0 runs0 likes0 downloads0 reach0 impact
8111 instances - 4 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
10578 instances - 8 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
The weather problem is a tiny dataset that we will use repeatedly to illustrate machine learning methods. Entirely fictitious, it supposedly concerns the conditions that are suitable for playing some…
0 runs0 likes0 downloads0 reach0 impact
14 instances - 5 features - 2 classes - 0 missing values
Subsampling of the dataset okcupid-stem (42734) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 20 features - 3 classes - 5980 missing values
Subsampling of the dataset okcupid-stem (42734) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 20 features - 3 classes - 6117 missing values
Subsampling of the dataset okcupid-stem (42734) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 20 features - 3 classes - 6105 missing values
This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days.The attributes present analyzed are: Recommendation,…
0 runs0 likes0 downloads0 reach0 impact
500 instances - 13 features - 0 classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach0 impact
26677 instances - 14 features - 3 classes - 0 missing values
good
0 runs0 likes0 downloads0 reach0 impact
10 instances - 4 features - classes - 2 missing values
artificial with anomaly
0 runs0 likes0 downloads0 reach0 impact
4032 instances - 3 features - classes - 0 missing values
artificial with anomaly
0 runs0 likes0 downloads0 reach0 impact
4032 instances - 3 features - classes - 0 missing values