Data
Filter results by:
#### Information A small classic dataset from Fisher, 1936. One of the earliest datasets used for the evaluation of classification methodologies. #### References * Fisher, R. A. (1936), The use of…
0 runs0 likes1 downloads1 reach0 impact
150 instances - 7 features - classes - 0 missing values
leak detection file
0 runs0 likes0 downloads0 reach0 impact
23 instances - 4 features - classes - 0 missing values
Context This dataset was collected by Neha Prerna Tigga and Dr. Shruti Garg of the Department of Computer Science and Engineering, BIT Mesra, Ranchi-835215 for research, non-commercial purposes only.…
0 runs0 likes0 downloads0 reach0 impact
952 instances - 18 features - classes - 48 missing values
ContextThisdatasetwascollectedbyNehaPrernaTiggaandDrShrutiGargoftheDepartmentofComputerScienceandEngineeringBITMesraRanchi835215forresearchnoncommercialpurposesonlyAnarticleisalsopublishedimplementingthisdatasetFormoreinformationandcitationofthisdatasetpleasereferTiggaNPGargS2020PredictionofType2DiabetesusingMachineLearningClassificationMethodsProcediaComputerScience167706716DOIhttpsdoiorg101016jprocs202003336ContentThereisatotalof952instanceswith17independentpredictorvariablesandonebinarytargetordependentvariableDiabetesAcknowledgementsWewouldliketothankalltheparticipantswhocontributedtowardsthebuildingofthisdatasetInspirationTobuildamachinelearningalgorithmtopredictifapersonhasdiabetesornot…
0 runs0 likes0 downloads0 reach0 impact
952 instances - 18 features - classes - 48 missing values
Description This is a countrywide weather events dataset that includes 6.3 million events, and covers 49 states of the United States. Examples of weather events are rain, snow, storm, and freezing…
0 runs0 likes0 downloads0 reach0 impact
7479165 instances - 14 features - classes - 73797 missing values
Context Towards Data Science Inc. is a corporation registered in Canada as a platform for thousands of people to exchange ideas and to expand understanding of data science. This dataset contains data…
0 runs0 likes0 downloads0 reach0 impact
46079 instances - 8 features - classes - 3 missing values
Context Japanese animation, which is known as anime, has become internationally widespread nowadays. This dataset provides data on anime taken from Anime News Network. Content This dataset consists of…
0 runs0 likes0 downloads0 reach0 impact
1563 instances - 20 features - classes - 0 missing values
Context The data obtained from the Mexico's General Direction of Epidemiology contains multiple information on the current pandemic situation. However, these data are saturated with features that may…
0 runs0 likes0 downloads0 reach0 impact
92320 instances - 7 features - classes - 0 missing values
Statistics on a Blockbreaker-like Game The author is in the process of creating a blockbreaker-like game, in which the jumping-off point is the "Block Breaker" section of the Udemy course, Complete C…
0 runs0 likes0 downloads0 reach0 impact
6814 instances - 7 features - classes - 0 missing values
Context COVID19 is spreading across the globe and this data set help in analyzing to what extent the pandemic has affected different countries. Content This data set contains the total number of COVID…
0 runs0 likes0 downloads0 reach0 impact
26355 instances - 6 features - classes - 844 missing values
Context Last time I built an LSTM price prediction for Corn, but the result is not satisfactory. I would like to try other algorithm and data. So I decided to use Wheat price for the exercise. And…
0 runs0 likes0 downloads0 reach0 impact
2272 instances - 5 features - classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
53940 instances - 7 features - 0 classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
bnlearn Bayesian Network Repository reference: [URL](https://www.bnlearn.com/bnrepository/discrete-medium.html#water) - Number of nodes: 32 - Number of arcs: 66 - Number of parameters: 10083 - Average…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 32 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
1187 runs0 likes0 downloads0 reach0 impact
412 instances - 9 features - 7 classes - 96 missing values
This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day. (See Duda & Hart, for…
7928 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
117 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
104 runs0 likes0 downloads0 reach0 impact
57 instances - 11 features - 2 classes - 1 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
777 runs0 likes0 downloads0 reach0 impact
625 instances - 5 features - 2 classes - 0 missing values
Iris DataSet
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
This dataset contains a simulation of the Lorenz attractor with the parameter $\rho$ varying in time. The stable and chaotic regimes alternate.
0 runs0 likes0 downloads0 reach0 impact
4942 instances - 4 features - classes - 0 missing values
iris with ignored features Sepal.Width and Petal.Length
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 3 classes - 0 missing values
Palmer Penguins Dataset The goal of palmerpenguins is to provide a great dataset for data exploration visualization, as an alternative to iris. About the data Data were collected and made available by…
0 runs0 likes0 downloads0 reach0 impact
344 instances - 7 features - classes - 19 missing values
Context New Zealand lies on a fault-line that runs through its spine. This fault line aka Alpine Fault is very active and forms a part of the "Ring of Fire". Content This is a list of all the earth…
0 runs0 likes0 downloads0 reach0 impact
20648 instances - 5 features - classes - 0 missing values
Content This Dataset Contains informations about Netflix movies, and how the Netflix website and use it to give me recommendations about movies of the genre that I prefer. Here, I used Netflix's API…
0 runs0 likes0 downloads0 reach0 impact
3323 instances - 10 features - classes - 0 missing values
This Dataset is something I found online when I wanted to practice regression models. It is an openly available online dataset at multiple places. Though I do not know the exact origin and collection…
0 runs0 likes0 downloads0 reach0 impact
1338 instances - 7 features - classes - 0 missing values
Context The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings reflecting overall patient satisfaction. Content Data was acquired by…
0 runs0 likes0 downloads0 reach0 impact
362806 instances - 11 features - classes - 42 missing values
Context Winter Storm Uri in February 2021 caused havoc across the United States and specifically to Texas involving mass power outages, water and food shortages, and dangerous weather conditions. This…
0 runs0 likes0 downloads0 reach0 impact
23358 instances - 14 features - classes - 53699 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
53940 instances - 7 features - 0 classes - 0 missing values
This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced frequently to this day. (See Duda & Hart, for…
0 runs0 likes0 downloads0 reach0 impact
149 instances - 5 features - 3 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
163065 instances - 4 features - 0 classes - 0 missing values
a
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - classes - 0 missing values
The Three Gorges Dam is a hydroelectric gravity dam on the Yangtze River, near the city of Yichang, Hubei province, China. It has total water storage capacity of 39.3 billion cubic metres. It is…
0 runs0 likes0 downloads0 reach0 impact
3667 instances - 5 features - classes - 525 missing values
Author: Volker Lohweg (University of Applied Sciences, Ostwestfalen-Lippe) Source: [UCI](https://archive.ics.uci.edu/ml/datasets/banknote+authentication) - 2012 Please cite:…
138170 runs6 likes40 downloads46 reach34 impact
1372 instances - 5 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
578 instances - 4 features - classes - 0 missing values
Description: The Electric_Vehicle_Population_Data.csv dataset provides a comprehensive overview of the electric vehicle (EV) population within a specific region, highlighting key information about the…
0 runs0 likes0 downloads0 reach0 impact
181458 instances - 17 features - classes - 421 missing values
Intra
0 runs0 likes0 downloads0 reach0 impact
204 instances - 22 features - classes - 1473 missing values
Intra
0 runs0 likes0 downloads0 reach0 impact
204 instances - 22 features - classes - 1473 missing values
Intra
0 runs0 likes0 downloads0 reach0 impact
204 instances - 22 features - classes - 1473 missing values
Data taken from the Blood Transfusion Service Center in Hsin-Chu City in Taiwan -- this is a classification problem. To demonstrate the RFMTC marketing model (a modified version of RFM), this study…
468690 runs6 likes101 downloads107 reach46 impact
748 instances - 5 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on both numerical and categorical…
1 runs0 likes0 downloads0 reach0 impact
163065 instances - 4 features - 0 classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure.
0 runs0 likes0 downloads0 reach0 impact
539383 instances - 8 features - 0 classes - 0 missing values
Description: The dataset games.csv is a comprehensive collection of chess game records, providing detailed insights into game outcomes, player ratings, moves, and opening strategies. It consists of…
0 runs0 likes0 downloads0 reach0 impact
20058 instances - 16 features - classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
119 runs0 likes0 downloads0 reach0 impact
50 instances - 6 features - 2 classes - 0 missing values
Data file: This data from "Problem-Solving" on "backache in pregnancy" is in somewhat different format from that listed in the book. Each integer is preceded by a space. This makes it easier to read.…
174 runs0 likes0 downloads0 reach0 impact
180 instances - 32 features - 2 classes - 0 missing values
PRO FOOTBALL SCORES (raw data appears after the description below) How well do the oddsmakers of Las Vegas predict the outcome of professional football games? Is there really a home field advantage -…
16080 runs0 likes0 downloads0 reach0 impact
672 instances - 10 features - 2 classes - 1200 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
18 runs0 likes0 downloads0 reach0 impact
159 instances - 10 features - 0 classes - 6 missing values
This dataset is synthetic. It was generated by David Coleman at RCA Laboratories in Princeton, N.J. For convenience, we will refer to it as the POLLEN DATA. The first three variables are the lengths…
0 runs0 likes0 downloads0 reach0 impact
3848 instances - 5 features - 0 classes - 0 missing values
This analysis describes and summarizes the relationships between 1987 salaries of major league baseball players and the player's performance. The salary data were taken from Sports Illustrated, April…
0 runs0 likes0 downloads0 reach0 impact
26 instances - 8 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
53 runs0 likes0 downloads0 reach0 impact
92 instances - 6 features - 0 classes - 26 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
60 instances - 11 features - 0 classes - 14 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Gasoline comnsumption is being treated as…
2 runs0 likes0 downloads0 reach0 impact
27 instances - 5 features - 0 classes - 0 missing values
DATA-SETS FROM DIGGLE, P.J. (1990). TIME SERIES : A BIOSTATISTICAL INTRODUCTION. Oxford University Press. Table: Table A1 Lutenizing hormone Information about the dataset CLASSTYPE: numeric…
0 runs0 likes0 downloads0 reach0 impact
48 instances - 5 features - 0 classes - 0 missing values
Contains 110 data sets from the book 'The Statistical Sleuth' by Fred Ramsey and Dan Schafer; Duxbury Press, 1997. (schafer@stat.orst.edu) [14/Oct/97] (172k) Note: description taken from this web…
2 runs0 likes0 downloads0 reach0 impact
93 instances - 7 features - 0 classes - 0 missing values
This S dump contains 22 data sets from the book Visualizing Data published by Hobart Press (books@hobart.com). The dump was created by data.dump() and can be read back into S by data.restore(). The…
0 runs0 likes0 downloads0 reach0 impact
323 instances - 5 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
672 runs0 likes0 downloads0 reach0 impact
158 instances - 8 features - 2 classes - 87 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
723 runs0 likes0 downloads0 reach0 impact
34 instances - 9 features - 2 classes - 0 missing values
A shar archive of data from the book Data Analysis: An Introduction(1992) Prentice Hall bu Jeff Witmer. Submitted by Jeff Witmer (fwitmer@ocvaxa.cc.oberlin.edu) [28/Jun/94] (29 kbytes) Note:…
2 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
853 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1173 runs0 likes0 downloads0 reach0 impact
100 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
847 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
624 runs0 likes0 downloads0 reach0 impact
1000 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
789 runs0 likes0 downloads0 reach0 impact
73 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
780 runs0 likes0 downloads0 reach0 impact
66 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
786 runs0 likes0 downloads0 reach0 impact
500 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1111 runs0 likes0 downloads0 reach0 impact
100 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
854 runs0 likes0 downloads0 reach0 impact
250 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1119 runs0 likes0 downloads0 reach0 impact
100 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
866 runs0 likes0 downloads0 reach0 impact
7129 instances - 6 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
598 runs0 likes0 downloads0 reach0 impact
1000 instances - 6 features - 2 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach0 impact
150 instances - 5 features - 0 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
3 runs0 likes0 downloads0 reach0 impact
50 instances - 5 features - 0 classes - 0 missing values
* Title: User Knowledge Modeling Data Set * Abstract: It is the real dataset about the students' knowledge status about the subject of Electrical DC Machines. The dataset had been obtained from Ph.D.…
153 runs0 likes0 downloads0 reach0 impact
403 instances - 6 features - 5 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
Dataset from Smoothing Methods in Statistics (ftp stat.cmu.edu/datasets) Simonoff, J.S. (1996). Smoothing Methods in Statistics. New York: Springer-Verlag. Points scored per minute is being treated as…
2 runs0 likes0 downloads0 reach0 impact
96 instances - 5 features - 0 classes - 0 missing values
libSVM","AAD group A practical guide to support vector classification. Technical report, Department of Computer Science, National Taiwan University, 2003. #Dataset from the LIBSVM data repository…
0 runs0 likes0 downloads0 reach0 impact
7089 instances - 5 features - 0 classes - 0 missing values
Modified version of the training dataset of the Bike Sharing Demand challenge running on Kaggle (http://www.kaggle.com/c/bike-sharing-demand/) If you use the problem in publication, please cite:…
0 runs0 likes0 downloads0 reach0 impact
10886 instances - 11 features - 0 classes - 0 missing values
Dataset Title: Localization Data for Person Activity Data Set Abstract: Data contains recordings of five people performing different activities. Each person wore four sensors (tags) while performing…
6 runs0 likes0 downloads0 reach0 impact
164860 instances - 8 features - 11 classes - 0 missing values
This is the same data as version 5 (OpenML ID = 1220) with '_id' features coded as nominal factor variables.
0 runs0 likes0 downloads0 reach0 impact
39948 instances - 12 features - 2 classes - 0 missing values
simple engine data
52 runs0 likes0 downloads0 reach0 impact
383 instances - 6 features - 3 classes - 0 missing values
Content Microsoft is an American multinational technology company. It develops, manufactures, licenses, supports, and sells computer software, consumer electronics, personal computers, and related…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 6 features - classes - 0 missing values
Context Google PlayStore App analytics. (1.1 Million + App Data) Source: https://github.com/gauthamp10/Google-Playstore-Dataset Content I've collected the data with the help of Python and Scrapy…
0 runs0 likes0 downloads0 reach0 impact
2312944 instances - 24 features - classes - 1355506 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
3172 instances - 6 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on numerical features" benchmark.…
0 runs0 likes0 downloads0 reach0 impact
3172 instances - 6 features - 2 classes - 0 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "regression on numerical features" benchmark. Original…
0 runs0 likes0 downloads0 reach0 impact
3172 instances - 6 features - 2 classes - 0 missing values
Subsampling of the dataset Click_prediction_small (42733) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 12 features - 2 classes - 0 missing values
Subsampling of the dataset Click_prediction_small (42733) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 12 features - 2 classes - 0 missing values
Subsampling of the dataset Click_prediction_small (42733) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 12 features - 2 classes - 0 missing values
Subsampling of the dataset Click_prediction_small (42733) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 12 features - 2 classes - 0 missing values
Subsampling of the dataset Click_prediction_small (42733) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 12 features - 2 classes - 0 missing values
Thyroid disease records supplied by the Garavan Institute and J. Ross Quinlan, New South Wales Institute, Syndney, Australia. 1987.
0 runs0 likes0 downloads0 reach0 impact
3103 instances - 23 features - 2 classes - 0 missing values
This data set contains details of a bank's customers and the target variable is a binary variable reflecting the fact whether the customer left the bank (closed his account) or he continues to be a…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 11 features - 0 classes - 0 missing values
Data has been taken from various sources such as data gov and various other websites and has been pre processed for analysis purpose
0 runs0 likes0 downloads0 reach0 impact
204 instances - 5 features - classes - 0 missing values