Data
Filter results by:
Mexico COVID-19 clinical data This dataset contains the results of real-time PCR testing for COVID-19 in Mexico as reported by the [General Directorate of…
0 runs0 likes0 downloads0 reach0 impact
263007 instances - 41 features - classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
714 runs0 likes4 downloads4 reach15 impact
303 instances - 14 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
737 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 2 classes - 6 missing values
If you reach this DATASET, please UPVOTE this dataset to show your appreciation DATASET DETAILS: 1.Previous Close: 264.29, 2.Open: 265.58, 3.Bid: 266.06 x 800, 4.Ask: 266.06 x 900, 5.Day's Range:…
0 runs0 likes0 downloads0 reach0 impact
9823 instances - 7 features - classes - 6 missing values
Overview This dataset contains historical daily prices for all tickers currently trading on ANTM.JK. since 29th Sept 2005 until 3rd Feb 2021. PT Aneka Tambang Tbk, colloquially known as Antam or…
0 runs0 likes0 downloads0 reach0 impact
3808 instances - 7 features - classes - 6 missing values
Context Apple has become a household name now a days as many people are using Apple products such as Iphone, Ipad, Apple watch etc. Apple recently became the only company to hit the 2 Trillion dollar…
0 runs0 likes0 downloads0 reach0 impact
10016 instances - 7 features - classes - 6 missing values
1. Title: Lung Cancer Data 2. Source Information: - Data was published in : Hong, Z.Q. and Yang, J.Y. "Optimal Discriminant Plane for a Small Number of Samples and Design Method of Classifier on the…
1238 runs0 likes0 downloads0 reach0 impact
32 instances - 57 features - 3 classes - 5 missing values
Content As of 2020, 6.7 million people reside in Victoria, Australia's second most populated state. Most of them, 5 million, live or work in Melbourne, state's capital. During 2020 Australia was among…
0 runs0 likes0 downloads0 reach0 impact
2106 instances - 14 features - classes - 4 missing values
This dataset has been scrapped off Goodreads to obtain land information about the best books of the 19th Century. Feature Description Book_Name the title of the book Author_Name the author(s) of the…
0 runs0 likes0 downloads0 reach0 impact
1101 instances - 5 features - classes - 4 missing values
Context Patients with Liver disease have been continuously increasing because of excessive consumption of alcohol, inhale of harmful gases, intake of contaminated food, pickles and drugs. This dataset…
0 runs0 likes0 downloads0 reach0 impact
583 instances - 11 features - 0 classes - 4 missing values
Data Set Information This data set contains 416 liver patient records and 167 non liver patient records.The data set was collected from test samples in North East of Andhra Pradesh, India.…
0 runs0 likes0 downloads0 reach0 impact
583 instances - 11 features - classes - 4 missing values
Context Hourly weather data for New York City. Extracted from online web sources. The following data set is cleaned for the purpose for NYC Taxi ETA calculation. Content We have features such as Date,…
0 runs0 likes0 downloads0 reach0 impact
5141 instances - 7 features - classes - 4 missing values
Context Patients with Liver disease have been continuously increasing because of excessive consumption of alcohol, inhale of harmful gases, intake of contaminated food, pickles and drugs. This dataset…
0 runs0 likes0 downloads0 reach0 impact
583 instances - 11 features - 0 classes - 4 missing values
Context The card payments data is published by the Reserve Bank of India on a monthly basis. The statistics cover the methods of payment used in retail transactions and ATM transactions in India. It…
0 runs0 likes0 downloads0 reach0 impact
5592 instances - 20 features - classes - 4 missing values
Description: This dataset contains information about real estate properties in Amsterdam. It includes details such as address, zip code, price, area size, number of rooms, longitude, and latitude…
0 runs0 likes0 downloads0 reach0 impact
924 instances - 8 features - classes - 4 missing values
Description: This dataset contains information about real estate properties in Amsterdam. It includes details such as address, zip code, price, area size, number of rooms, longitude, and latitude…
0 runs0 likes0 downloads0 reach0 impact
924 instances - 8 features - classes - 4 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
717 runs0 likes0 downloads0 reach0 impact
90 instances - 9 features - 2 classes - 3 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach0 impact
40 instances - 7 features - 0 classes - 3 missing values
Context Everybody nowadays is mindful of what they eat. Counting calories and reducing fat intake is the number one advice given by all dieticians and nutritionists. Therefore, we need to know what…
0 runs0 likes0 downloads0 reach0 impact
335 instances - 10 features - classes - 3 missing values
Context Dataset is generated through a long and complex process. Starting from scrapping the whole URLs provided on Genius.com for Game of Thrones series. Process on scrapping and cleaning the dataset…
0 runs0 likes0 downloads0 reach0 impact
23911 instances - 6 features - classes - 3 missing values
Context Towards Data Science Inc. is a corporation registered in Canada as a platform for thousands of people to exchange ideas and to expand understanding of data science. This dataset contains data…
0 runs0 likes0 downloads0 reach0 impact
46079 instances - 8 features - classes - 3 missing values
1. Title: Postoperative Patient Data 2. Source Information: -- Creators: Sharon Summers, School of Nursing, University of Kansas Medical Center, Kansas City, KS 66160 Linda Woolery, School of Nursing,…
1758 runs0 likes0 downloads0 reach0 impact
90 instances - 9 features - 3 classes - 3 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 19 features - 0 classes - 3 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
366 instances - 5 features - classes - 2 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Case number deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based learning…
10 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 0 classes - 2 missing values
This data was collected from combine primary and secondary sources, through questionnaire, verbal interview and some part of the hospital’s record department’s data, from the selected…
0 runs0 likes0 downloads0 reach0 impact
281 instances - 98 features - 2 classes - 2 missing values
50 Danish words with their pronunciation from Dansk Ordbog
0 runs0 likes0 downloads0 reach0 impact
51 instances - 2 features - classes - 2 missing values
## **Meta-Album OmniPrint-MD-5-bis Dataset (Mini)** OmniPrint-MD-5-bis dataset consists of 28 240 images (128x128, RGB) from 706 categories. The images are synthesized with OmniPrint, and no further…
0 runs0 likes0 downloads0 reach1 impact
28240 instances - 39 features - 0 classes - 2 missing values
good
0 runs0 likes0 downloads0 reach0 impact
10 instances - 4 features - classes - 2 missing values
Context Based on 250K lyrics database. Created to perform Supervised NLP sentiment analysis task using Spotify valence audio feature, a measure of the positiveness of the song. Content Preparation of…
0 runs0 likes0 downloads0 reach0 impact
158353 instances - 5 features - classes - 2 missing values
These data are the results of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of 13 constituents found…
0 runs0 likes0 downloads0 reach0 impact
178 instances - 14 features - classes - 2 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
118 runs0 likes0 downloads0 reach0 impact
195 instances - 11 features - 2 classes - 2 missing values
Context This dataset is a playground for fundamental and technical analysis. This can serve as basic tutorial for time-series data analysis. Content Dataset consists of following files:…
0 runs0 likes0 downloads0 reach0 impact
744 instances - 13 features - classes - 2 missing values
Predicting forest cover ...
0 runs0 likes0 downloads0 reach0 impact
18182 instances - 14 features - 0 classes - 2 missing values
Predicting forest cover ...
0 runs0 likes0 downloads0 reach0 impact
18182 instances - 14 features - 0 classes - 2 missing values
The "Cookbook Reviews" is an extensive data set that includes a range of information about user interactions and recipe reviews. It contains important details like the recipe name, where it stands in…
0 runs0 likes0 downloads0 reach0 impact
18182 instances - 14 features - 0 classes - 2 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
32 runs0 likes0 downloads0 reach0 impact
57 instances - 11 features - 5 classes - 1 missing values
Context This dataset contains IMDb ratings and votes information for movies having original title. Useful for creating top rated movies recommender system. Content Descriptions of the columns: titleId…
0 runs0 likes0 downloads0 reach0 impact
67408 instances - 5 features - classes - 1 missing values
Context This Dataset, consist of Tweets regarding Machine Learning, posted by Prof.Andrew Ng, Co-founder of Coursera, and an Adjunct Professor of Computer Science at Stanford University. His machine…
0 runs0 likes0 downloads0 reach0 impact
1234 instances - 8 features - classes - 1 missing values
Chocolate Bar Ratings. Expert ratings of over 1,700 chocolate bars. Each chocolate is evaluated from a combination of both objective qualities and subjective interpretation. A rating here only…
0 runs0 likes0 downloads0 reach0 impact
1795 instances - 9 features - 42 classes - 1 missing values
dgf_test
0 runs0 likes0 downloads0 reach0 impact
3415 instances - 5 features - 2 classes - 1 missing values
dgf_test
0 runs0 likes0 downloads0 reach0 impact
3415 instances - 5 features - 2 classes - 1 missing values
Content This data is an extract from a bigger reddit dataset (All reddit comments from May 2019, 157Gb or data uncompressed) that contains both more comments and more associated informations…
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 4 features - classes - 1 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
113 runs0 likes0 downloads0 reach0 impact
366 instances - 5 features - 2 classes - 1 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
104 runs0 likes0 downloads0 reach0 impact
57 instances - 11 features - 2 classes - 1 missing values
Context It's the preprocessed train data from Quora Insincere Questions competition 2018 The original train data is preprocessed to remove stop words, numbers, punctuations, common words and converted…
0 runs0 likes0 downloads0 reach0 impact
1306122 instances - 4 features - classes - 1 missing values
What is World of Wacraft? According to Wikipedia: World of Warcraft (WoW) is a massively multiplayer online role-playing game (MMORPG) released in 2004 by Blizzard Entertainment. It is the fourth…
0 runs0 likes0 downloads0 reach0 impact
478 instances - 6 features - classes - 1 missing values
I have created this dataset to showcase the use of predictive modeling using the stock market as a case study. This dataset is designed to help and predict tomorrow's Amazon stock price. If you want…
0 runs0 likes0 downloads0 reach0 impact
350 instances - 8 features - classes - 1 missing values
Upstream data from the twin Archimedes screw hydro-electric generator on the river Thames at Caversham weir, Reading, UK.
0 runs0 likes0 downloads0 reach0 impact
235 instances - 2 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10402, and it has 65 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
65 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30047, and it has 97 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
97 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11054, and it has 344 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
344 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11499, and it has 12 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
12 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11286, and it has 297 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
297 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17024, and it has 373 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
373 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 279, and it has 126 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
126 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100848, and it has 60 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
60 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100869, and it has 18 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
18 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10541, and it has 151 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
151 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17075, and it has 15 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
15 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101309, and it has 73 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
73 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12950, and it has 34 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
34 instances - 1026 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
336 instances - 15 features - classes - 0 missing values
We use the following representation to collect the dataset age - age bp - blood pressure sg - specific gravity al - albumin su - sugar rbc - red blood cells pc - pus cell pcc - pus cell clumps ba -…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 28 features - classes - 0 missing values
No data.
222 runs0 likes0 downloads0 reach0 impact
1504 instances - 2887 features - 13 classes - 0 missing values
No data.
428 runs0 likes0 downloads0 reach0 impact
1003 instances - 3183 features - 10 classes - 0 missing values
No data.
268 runs0 likes0 downloads0 reach0 impact
3075 instances - 12433 features - 6 classes - 0 missing values
No data.
373 runs0 likes0 downloads0 reach0 impact
918 instances - 3013 features - 10 classes - 0 missing values
No data.
159 runs0 likes0 downloads0 reach0 impact
1657 instances - 3759 features - 25 classes - 0 missing values
No data.
264 runs0 likes0 downloads0 reach0 impact
3204 instances - 13196 features - 6 classes - 0 missing values
No data.
211 runs0 likes0 downloads0 reach0 impact
313 instances - 5805 features - 8 classes - 0 missing values
No data.
163 runs0 likes0 downloads0 reach0 impact
1560 instances - 8461 features - 20 classes - 0 missing values
No data.
216 runs0 likes0 downloads0 reach0 impact
11162 instances - 11466 features - 10 classes - 0 missing values
No data.
203 runs0 likes0 downloads0 reach0 impact
878 instances - 7455 features - 10 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
6 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
34 instances - 1143 features - 0 classes - 0 missing values
This is one of 41 drug design datasets. The datasets with 1143 features are formed using Adriana.Code software (www.molecular-networks.com/software/adrianacode). The molecules and outputs are taken…
0 runs0 likes0 downloads0 reach0 impact
15 instances - 10 features - 0 classes - 0 missing values
No data.
414 runs0 likes0 downloads0 reach0 impact
690 instances - 8262 features - 10 classes - 0 missing values
No data.
220 runs0 likes0 downloads0 reach0 impact
336 instances - 7903 features - 6 classes - 0 missing values
No data.
108 runs0 likes0 downloads0 reach0 impact
927 instances - 10129 features - 7 classes - 0 missing values
No data.
377 runs0 likes0 downloads0 reach0 impact
913 instances - 3101 features - 10 classes - 0 missing values
No data.
219 runs0 likes0 downloads0 reach0 impact
414 instances - 6430 features - 9 classes - 0 missing values
No data.
215 runs0 likes0 downloads0 reach0 impact
204 instances - 5833 features - 6 classes - 0 missing values
No data.
426 runs0 likes0 downloads0 reach0 impact
2463 instances - 2001 features - 17 classes - 0 missing values
No data.
67 runs0 likes0 downloads0 reach0 impact
9558 instances - 26833 features - 44 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
2 runs0 likes0 downloads0 reach0 impact
108 instances - 4 features - 0 classes - 0 missing values
Data on the homicide rate in Detroit for the years 1961-1973. This is the data set called DETROIT in the book 'Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values
Data on the recurrence times to infection, at the point of insertion of the catheter, for kidney patients using portable dialysis equipment. Catheters may be removed for reasons other than infection,…
2 runs0 likes0 downloads0 reach0 impact
76 instances - 7 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
450 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
2 runs0 likes0 downloads0 reach0 impact
475 instances - 4 features - 0 classes - 0 missing values
A family of datasets synthetically generated from a simulation of how bank-customers choose their banks. Tasks are based on predicting the fraction of bank customers who leave the bank because of full…
0 runs0 likes0 downloads0 reach0 impact
8192 instances - 33 features - 0 classes - 0 missing values
The Boston house-price data of Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. Used in Belsley, Kuh & Welsch,…
9 runs0 likes0 downloads0 reach0 impact
506 instances - 14 features - 0 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 0 classes - 0 missing values
Determinants of Wages from the 1985 Current Population Survey Summary: The Current Population Survey (CPS) is used to supplement census information between census years. These data consist of a random…
2 runs0 likes0 downloads0 reach0 impact
534 instances - 11 features - 0 classes - 0 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
2 runs0 likes0 downloads0 reach0 impact
100 instances - 4 features - 0 classes - 0 missing values
Information about the dataset CLASSTYPE: numeric CLASSINDEX: last
2 runs0 likes0 downloads0 reach0 impact
559 instances - 5 features - 0 classes - 0 missing values
If you use an algorithm, dataset, or other information from StatLib, please acknowledge both StatLib and the original contributor of the material. Pace and Barry (1997), "Sparse Spatial…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 9 features - 0 classes - 0 missing values
This file contains the data in "The MU284 Population" from Appendix B of the book "Model Assisted Survey Sampling" by Sarndal, Swensson and Wretman, published by Springer-Verlag, New York, 1992. The…
0 runs0 likes0 downloads0 reach0 impact
284 instances - 10 features - 0 classes - 0 missing values
17x17x2x2 tables of counts in GLIM-ready format used for the analyses in Biblarz, Timothy J., and Adrian E. Raftery. 1993. "The Effects of Family Disruption on Social Mobility." American Sociological…
6 runs0 likes0 downloads0 reach0 impact
1156 instances - 6 features - 0 classes - 0 missing values