Author: Alen Shapiro Source: [UCI](https://archive.ics.uci.edu/ml/datasets/Chess+(King-Rook+vs.+King-Pawn)) Please cite: [UCI citation policy](https://archive.ics.uci.edu/ml/citation_policy.html) 1.…
274237 runs0 likes0 downloads0 reach0 impact
3196 instances - 37 features - 2 classes - 0 missing values
This database is a standardized version of the original audiology database (see audiology.* in this directory). The non-standard set of attributes have been converted to a standard set of attributes…
7303 runs0 likes0 downloads0 reach0 impact
226 instances - 70 features - 24 classes - 317 missing values
One of a set of 6 datasets describing features of handwritten numerals (0 - 9) extracted from a collection of Dutch utility maps. The maps were scanned in 8 bit grey value at density of 400dpi,…
26538 runs0 likes0 downloads0 reach0 impact
2000 instances - 241 features - 10 classes - 0 missing values
1. Title: Postoperative Patient Data 2. Source Information: -- Creators: Sharon Summers, School of Nursing, University of Kansas Medical Center, Kansas City, KS 66160 Linda Woolery, School of Nursing,…
1758 runs0 likes0 downloads0 reach0 impact
90 instances - 9 features - 3 classes - 3 missing values
### Description This dataset describes mushrooms in terms of their physical characteristics. They are classified into: poisonous or edible. ### Source ``` (a) Origin: Mushroom records are drawn from…
16692 runs0 likes0 downloads0 reach0 impact
8124 instances - 23 features - 2 classes - 2480 missing values
1. Title: Nursery Database 2. Sources: (a) Creator: Vladislav Rajkovic et al. (13 experts) (b) Donors: Marko Bohanec (marko.bohanec@ijs.si) Blaz Zupan (blaz.zupan@ijs.si) (c) Date: June, 1997 3. Past…
2210 runs0 likes0 downloads0 reach0 impact
12960 instances - 9 features - 5 classes - 0 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2009 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 2 classes - 9 missing values
No data.
70 runs0 likes0 downloads0 reach0 impact
1000000 instances - 28 features - 2 classes - 0 missing values
No data.
72 runs0 likes0 downloads0 reach0 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
194 runs0 likes0 downloads0 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
73 runs0 likes0 downloads0 reach0 impact
1000000 instances - 16 features - 2 classes - 0 missing values
No data.
87 runs0 likes0 downloads0 reach0 impact
295245 instances - 11 features - 5 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 21 features - 2 classes - 0 missing values
No data.
67 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 10 classes - 0 missing values
This data sets consists of 3 different types of irises' (Setosa, Versicolour, and Virginica) petal and sepal length, stored in a 150x4 numpy.ndarray
65 runs0 likes0 downloads0 reach0 impact
1000000 instances - 40 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 35 features - 6 classes - 0 missing values
No data.
211 runs0 likes0 downloads0 reach0 impact
1000000 instances - 20 features - 7 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 39 features - 6 classes - 0 missing values
No data.
324 runs0 likes0 downloads0 reach0 impact
1000000 instances - 37 features - 2 classes - 0 missing values
No data.
71 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 2 classes - 0 missing values
No data.
60 runs0 likes0 downloads0 reach0 impact
1000000 instances - 17 features - 26 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 26 features - 7 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 10 features - 2 classes - 0 missing values
No data.
48 runs0 likes0 downloads0 reach0 impact
1000000 instances - 77 features - 10 classes - 0 missing values
No data.
50 runs0 likes0 downloads0 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values
No data.
67 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
51 runs0 likes0 downloads0 reach0 impact
1000000 instances - 48 features - 10 classes - 0 missing values
No data.
1038 runs0 likes0 downloads0 reach0 impact
55296 instances - 10 features - 3 classes - 0 missing values
No data.
326 runs0 likes0 downloads0 reach0 impact
1000000 instances - 23 features - 2 classes - 0 missing values
No data.
68 runs0 likes0 downloads0 reach0 impact
1000000 instances - 19 features - 4 classes - 0 missing values
No data.
69 runs0 likes0 downloads0 reach0 impact
1000000 instances - 20 features - 2 classes - 0 missing values
No data.
356 runs0 likes0 downloads0 reach0 impact
131072 instances - 17 features - 2 classes - 0 missing values
No data.
65 runs0 likes0 downloads0 reach0 impact
1000000 instances - 30 features - 4 classes - 0 missing values
No data.
230 runs0 likes0 downloads0 reach0 impact
1000000 instances - 35 features - 2 classes - 0 missing values
No data.
63 runs0 likes0 downloads0 reach0 impact
1000000 instances - 41 features - 3 classes - 0 missing values
No data.
65 runs0 likes0 downloads0 reach0 impact
1000000 instances - 18 features - 7 classes - 0 missing values
No data.
73 runs0 likes0 downloads0 reach0 impact
1000000 instances - 30 features - 2 classes - 0 missing values
No data.
50 runs0 likes0 downloads0 reach0 impact
1000000 instances - 61 features - 2 classes - 0 missing values
No data.
90 runs0 likes0 downloads0 reach0 impact
137781 instances - 10 features - 7 classes - 0 missing values
No data.
314 runs0 likes0 downloads0 reach0 impact
1000000 instances - 36 features - 19 classes - 0 missing values
No data.
219 runs0 likes0 downloads0 reach0 impact
1000000 instances - 58 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
1457 runs0 likes0 downloads0 reach0 impact
39366 instances - 10 features - 2 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 5 classes - 0 missing values
No data.
334 runs0 likes0 downloads0 reach0 impact
1000000 instances - 33 features - 2 classes - 0 missing values
No data.
70 runs0 likes0 downloads0 reach0 impact
1000000 instances - 14 features - 2 classes - 0 missing values
Subsampling of the dataset kr-vs-kp (3) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 37 features - 2 classes - 0 missing values
Subsampling of the dataset Internet-Advertisements (40978) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 2 classes - 0 missing values
Subsampling of the dataset Amazon_employee_access (4135) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 10 features - 2 classes - 0 missing values
Subsampling of the dataset kr-vs-kp (3) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 37 features - 2 classes - 0 missing values
Subsampling of the dataset connect-4 (40668) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 43 features - 3 classes - 0 missing values
Subsampling of the dataset kr-vs-kp (3) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 37 features - 2 classes - 0 missing values
Subsampling of the dataset Amazon_employee_access (4135) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 10 features - 2 classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
Subsampling of the dataset Amazon_employee_access (4135) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 10 features - 2 classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
Subsampling of the dataset connect-4 (40668) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 43 features - 3 classes - 0 missing values
Subsampling of the dataset PhishingWebsites (4534) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample(…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 31 features - 2 classes - 0 missing values
Subsampling of the dataset kr-vs-kp (3) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 37 features - 2 classes - 0 missing values
Subsampling of the dataset connect-4 (40668) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 43 features - 3 classes - 0 missing values
Subsampling of the dataset dna (40670) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 3 classes - 0 missing values
Subsampling of the dataset dna (40670) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 101 features - 3 classes - 0 missing values
## **Meta-Album RSICB Dataset (Extended)** RSICB128 dataset (https://github.com/lehaifeng/RSI-CB) covers 45 scene categories, assembling in total 36 000 images of resolution 128x128 px. The data…
0 runs0 likes0 downloads0 reach1 impact
36707 instances - 3 features - 45 classes - 0 missing values
## **Meta-Album RSICB Dataset (Mini)** RSICB128 dataset (https://github.com/lehaifeng/RSI-CB) covers 45 scene categories, assembling in total 36 000 images of resolution 128x128 px. The data authors…
0 runs0 likes0 downloads0 reach1 impact
1800 instances - 3 features - 45 classes - 0 missing values
## **Meta-Album RSICB Dataset (Micro)** RSICB128 dataset (https://github.com/lehaifeng/RSI-CB) covers 45 scene categories, assembling in total 36 000 images of resolution 128x128 px. The data authors…
0 runs0 likes0 downloads0 reach1 impact
800 instances - 3 features - 20 classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
3097 instances - 2 features - 3 classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
1326 instances - 2 features - 3 classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
4423 instances - 2 features - 3 classes - 0 missing values
The ALARM ("A Logical Alarm Reduction Mechanism") is a Bayesian network designed to provide an alarm message system for patient monitoring. The alarm data set contains the following 37 variables: CVP…
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 37 features - classes - 0 missing values
24InternationalstudentsinChina-Continent
0 runs0 likes0 downloads0 reach0 impact
5 instances - 3 features - classes - 0 missing values
26InternationalstudentsinChina-Province
0 runs0 likes0 downloads0 reach0 impact
12 instances - 2 features - classes - 0 missing values
Context This dataset was a part of the assignment of my coursework. Content The dataset contains 90+ columns describing different aspects of all countries like GDP, Population, Electricity-consumption…
0 runs0 likes0 downloads0 reach0 impact
197 instances - 80 features - classes - 0 missing values
This dataset has been scrapped off Goodreads to obtain land information about the best books of the 19th Century. Feature Description Book_Name the title of the book Author_Name the author(s) of the…
0 runs0 likes0 downloads0 reach0 impact
1101 instances - 5 features - classes - 4 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
Context I collected about 1200 Covid-19 research articles from the NCBI.NLM.NIH website to be utilized in ML algorithms/ Data Analysis such as Sentiment Analysis, Time Series, Recommender System…
0 runs0 likes0 downloads0 reach0 impact
1198 instances - 5 features - classes - 1459 missing values
Context The dataset comes from one of the most important parts of a mining process: a flotation plant The main goal is to use this data to predict how much impurity is in the ore concentrate. As this…
0 runs0 likes0 downloads0 reach0 impact
737453 instances - 24 features - classes - 0 missing values
Context Recent growing interest in cryptocurrencies, specifically as a speculative investment vehicle, has sparked global conversation over the past 12 months. Although this data is available across…
0 runs0 likes0 downloads0 reach0 impact
28944 instances - 8 features - classes - 0 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
Context Throughout the world of data science, there are many languages and tools that can be used to complete a given task. While you are often able to use whichever tool you prefer, it is often…
0 runs0 likes0 downloads0 reach0 impact
10153 instances - 4 features - classes - 9824 missing values
Context This dataset was scraped from http://www.asapsports.com/, using the code in this repository. I designed the webscraping code to account for most of the variance in the website's formatting,…
0 runs0 likes0 downloads0 reach0 impact
2096 instances - 6 features - classes - 0 missing values
Context I wanted to use this dataset to combine with Toronto Police Homicide reports to determine how time of day (and specifically illumination) have an impact on crime. Content The dataset currently…
0 runs0 likes0 downloads0 reach0 impact
4018 instances - 3 features - classes - 0 missing values
Content This is a dataset I started building for my future personal projects, as I think this kind of data is quite hard to acquire for free and in short time. I started acquiring data on March 21st,…
0 runs0 likes0 downloads0 reach0 impact
193279 instances - 4 features - classes - 29954 missing values
Probable risk factors for coronary thrombosis, comprising data from 1841 men. The coronary data set contains the following 6 variables: Smoking (smoking): a two-level factor with levels no and yes. M.…
0 runs0 likes0 downloads0 reach0 impact
1841 instances - 6 features - classes - 0 missing values
Insurance is a network for evaluating car insurance risks. The insurance data set contains the following 27 variables: GoodStudent (good student): a two-level factor with levels False and True. Age…
0 runs0 likes0 downloads0 reach0 impact
20000 instances - 27 features - classes - 0 missing values
A Gold Standard of ~4,400 questions, answers, and comments from Stack Overflow, manually annotated for polarity. The dataset has been used for developing the EMTk toolkit for polarity detection from…
0 runs0 likes0 downloads0 reach0 impact
4423 instances - 2 features - 3 classes - 0 missing values
Context Dataset is generated through a long and complex process. Starting from scrapping the whole URLs provided on Genius.com for Game of Thrones series. Process on scrapping and cleaning the dataset…
0 runs0 likes0 downloads0 reach0 impact
23911 instances - 6 features - classes - 3 missing values
Context Everybody nowadays is mindful of what they eat. Counting calories and reducing fat intake is the number one advice given by all dieticians and nutritionists. Therefore, we need to know what…
0 runs0 likes0 downloads0 reach0 impact
335 instances - 10 features - classes - 3 missing values
Context The objective of this dataset is to create a chess engine through machine learning. In this first part we will first predict the pieces to be moved depending on the position of the chessboard…
0 runs0 likes0 downloads0 reach0 impact
2632753 instances - 66 features - classes - 0 missing values
Context and Content The COVID-19 case surveillance system database includes individual-level data reported to U.S. states and autonomous reporting entities, including New York City and the District of…
0 runs0 likes0 downloads0 reach0 impact
8405079 instances - 11 features - classes - 9543526 missing values
Dutch News Articles This dataset contains all the articles published by the NOS as of the 1st of January 2010. The data is obtained by scraping the NOS website. The NOS is one of the biggest (online)…
0 runs0 likes0 downloads0 reach0 impact
237861 instances - 5 features - classes - 0 missing values
Data Description Dataset is from one of the leading travel site containing hotel reviews provided by customers. Variable Description User_ID unique ID of the customer Description description of the…
0 runs0 likes0 downloads0 reach0 impact
38932 instances - 4 features - classes - 0 missing values
Context I was exploring League of Legends datasets to play around but since Riot allows limited calls to their API, I've collected the data from OP.GG. Few goals of mine were to find out the best team…
0 runs0 likes0 downloads0 reach0 impact
4028 instances - 6 features - classes - 0 missing values
Context The Dow Jones Industrial Average is one of the most followed stock market indexes by investors, financial professionals and the media. Content It measures the daily price movements of 30 large…
0 runs0 likes0 downloads0 reach0 impact
2766 instances - 7 features - classes - 0 missing values
Automated file upload of BNG(spambase)
98 runs0 likes0 downloads0 reach0 impact
1000000 instances - 58 features - 2 classes - 0 missing values
Automated file upload of BNG(optdigits)
100 runs0 likes0 downloads0 reach0 impact
1000000 instances - 65 features - 10 classes - 0 missing values