OpenML
Filter results by:
dataset for bme
0 runs0 likes0 downloads0 reach0 impact
63 instances - 12 features - classes - 52 missing values
1. Title: INDUCE Trains Data set 2. Sources: - Donor: GMU, Center for AI, Software Librarian, Eric E. Bloedorn (bloedorn@aic.gmu.edu) - Original owners: Ryszard S. Michalski (michalski@aic.gmu.edu)…
1973 runs0 likes0 downloads0 reach0 impact
10 instances - 33 features - 2 classes - 51 missing values
This is the Car sales data set which include information about different cars . This data set is being taken from the Analytixlabs for the purpose of prediction In this we have to see two things First…
0 runs0 likes0 downloads0 reach0 impact
157 instances - 16 features - classes - 51 missing values
This a blend dataset that contains historic Swedish interest rates from 1908-2001 Source/Klla: Sveriges riksbank and Swedish inflation rate 1908-2001 fetched from Sweden's statistic central bureau…
0 runs0 likes0 downloads0 reach0 impact
109 instances - 4 features - classes - 49 missing values
Estimated article influence scores in 2015
0 runs0 likes0 downloads0 reach0 impact
3615 instances - 7 features - 3169 classes - 48 missing values
Context This dataset was collected by Neha Prerna Tigga and Dr. Shruti Garg of the Department of Computer Science and Engineering, BIT Mesra, Ranchi-835215 for research, non-commercial purposes only.…
0 runs0 likes0 downloads0 reach0 impact
952 instances - 18 features - classes - 48 missing values
ContextThisdatasetwascollectedbyNehaPrernaTiggaandDrShrutiGargoftheDepartmentofComputerScienceandEngineeringBITMesraRanchi835215forresearchnoncommercialpurposesonlyAnarticleisalsopublishedimplementingthisdatasetFormoreinformationandcitationofthisdatasetpleasereferTiggaNPGargS2020PredictionofType2DiabetesusingMachineLearningClassificationMethodsProcediaComputerScience167706716DOIhttpsdoiorg101016jprocs202003336ContentThereisatotalof952instanceswith17independentpredictorvariablesandonebinarytargetordependentvariableDiabetesAcknowledgementsWewouldliketothankalltheparticipantswhocontributedtowardsthebuildingofthisdatasetInspirationTobuildamachinelearningalgorithmtopredictifapersonhasdiabetesornot…
0 runs0 likes0 downloads0 reach0 impact
952 instances - 18 features - classes - 48 missing values
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies…
0 runs0 likes0 downloads0 reach0 impact
231 instances - 13 features - classes - 46 missing values
Context People love movies because: It takes you on a journey. Its an escape from reality. Being a vivid movie watcher I always get amazed how sites like Netflix and Hotstar always exactly suggest the…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 6 features - classes - 44 missing values
Context People love movies because: It takes you on a journey. Its an escape from reality. Being a vivid movie watcher I always get amazed how sites like Netflix and Hotstar always exactly suggest the…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 6 features - classes - 44 missing values
The dataset contains the serie a matches for season 2015-2016
0 runs0 likes0 downloads0 reach0 impact
379 instances - 38 features - classes - 44 missing values
Context The goal of this dataset is to provide a tidy way to access to the transcripts of speeches given by various US politicians in the context of the 2020 US Presidential Election. Transcripts have…
0 runs0 likes0 downloads0 reach0 impact
269 instances - 6 features - classes - 42 missing values
Context The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings reflecting overall patient satisfaction. Content Data was acquired by…
0 runs0 likes0 downloads0 reach0 impact
362806 instances - 11 features - classes - 42 missing values
Squash Harvest Unstored Data source: Winna Harvey Crop and Food Research, Christchurch, New Zealand The purpose of the research was to determine the changes taking place in squash fruit during the…
876 runs0 likes0 downloads0 reach0 impact
52 instances - 24 features - 3 classes - 39 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
687 runs0 likes0 downloads0 reach0 impact
52 instances - 24 features - 2 classes - 39 missing values
This dataset contains 3 more features compared to version 1 of the same dataset. Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 10 features - 0 classes - 38 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
485 runs0 likes0 downloads0 reach0 impact
76 instances - 15 features - 7 classes - 37 missing values
The coronavirus pandemic has affected the entire world and many families have been destroyed. The stock exchange was also affected, but vaccine companies took advantage of this moment and leveraged…
0 runs0 likes0 downloads0 reach0 impact
255 instances - 36 features - classes - 35 missing values
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
0 runs0 likes0 downloads0 reach0 impact
5100000 instances - 10 features - 0 classes - 34 missing values
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% This is a PROMISE Software Engineering Repository data set made publicly available in order to encourage repeatable,…
0 runs0 likes0 downloads0 reach0 impact
31 instances - 31 features - 0 classes - 33 missing values
Context It's always interesting to analyse what's going on in the news but there's no good data avaialable in indian context on kaggle, so I created one. Content Apart from the main content of the…
0 runs0 likes0 downloads0 reach0 impact
1568 instances - 8 features - classes - 33 missing values
Data on educational transitions for a sample of 500 Irish schoolchildren aged 11 in 1967. The data were collected by Greaney and Kelleghan (1984), and reanalyzed by Raftery and Hout (1985, 1993). ###…
16174 runs0 likes0 downloads0 reach0 impact
500 instances - 6 features - 2 classes - 32 missing values
Context The data set contains laboratory values of blood donors and Hepatitis C patients and demographic values like age. The data was obtained from UCI Machine Learning Repository:…
0 runs0 likes0 downloads0 reach0 impact
615 instances - 14 features - classes - 31 missing values
The data set contains laboratory values of blood donors and Hepatitis C patients and demographic values like age.The target attribute for classification is Category (blood donors vs. Hepatitis C…
0 runs0 likes0 downloads0 reach0 impact
615 instances - 14 features - classes - 31 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
365 instances - 4 features - 0 classes - 30 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
708 runs0 likes0 downloads0 reach0 impact
365 instances - 4 features - 2 classes - 30 missing values
Introduction TMDB.org is a crowd-sourced movie information database used by many film-related consoles, sites and apps, such as XBMC, MythTV and Plex. Dozens of media managers, mobile apps and social…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 6 features - classes - 30 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
537 runs0 likes0 downloads0 reach0 impact
285 instances - 8 features - 7 classes - 27 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
722 runs0 likes0 downloads0 reach0 impact
285 instances - 8 features - 2 classes - 27 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
53 runs0 likes0 downloads0 reach0 impact
92 instances - 6 features - 0 classes - 26 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
736 runs0 likes0 downloads0 reach0 impact
92 instances - 6 features - 2 classes - 26 missing values
# Space Shuttle Autolanding Domain NASA: Mr. Roger Burke's autolander design team ##### Past Usage: (several, it appears) Example: Michie,D. (1988). The Fifth Generation's Unbridged Gap. In Rolf…
1466 runs0 likes0 downloads0 reach0 impact
15 instances - 7 features - 2 classes - 26 missing values
This is a PROMISE data set made publicly available in order to encourage repeatable, verifiable, refutable, and/or improvable predictive models of software engineering. If you publish material based…
21918 runs0 likes0 downloads0 reach0 impact
10885 instances - 22 features - 2 classes - 25 missing values
Context This Dataset contains the value of the Bitcoin stock from 14th September 2014 till Date Content It is a very simple dataset to both explore and understand the columns are themselves…
0 runs0 likes0 downloads0 reach0 impact
2401 instances - 7 features - classes - 24 missing values
The data comprises the brand, type and model. For each car we have information about the cubic capacity (in cm3), the maximal power of engine (in kW), the maximal torque (in Nm), the number of seats,…
0 runs0 likes0 downloads0 reach0 impact
475 instances - 13 features - classes - 24 missing values
Context Dogecoin is making news as well as a little profit these days although it may seem like its new in the market but it has been around quite a while now. I have tried to collect historical data…
0 runs0 likes0 downloads0 reach0 impact
1131 instances - 7 features - classes - 24 missing values
Yu-Gi-Oh! is a Japanese manga series about gaming written and illustrated by Kazuki Takahashi. It was serialized in Shueisha's Weekly Shnen Jump magazine between September 30, 1996 and March 8, 2004.…
0 runs0 likes0 downloads0 reach0 impact
156 instances - 11 features - classes - 23 missing values
Context This Data set Contains the values of Stock of Amazon which dates between 15 July ,2019 to 15 July,2020 Content The Data set contains 7 different columns that includes Date, The opening value…
0 runs0 likes0 downloads0 reach0 impact
250 instances - 8 features - classes - 23 missing values
Arbres urbains
0 runs0 likes0 downloads0 reach0 impact
2 instances - 57 features - 1 classes - 22 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
106 runs0 likes0 downloads0 reach0 impact
76 instances - 45 features - 2 classes - 22 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
41 runs0 likes0 downloads0 reach0 impact
1340 instances - 17 features - 3 classes - 20 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
228 instances - 8 features - classes - 20 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
131 runs0 likes0 downloads0 reach0 impact
1340 instances - 17 features - 2 classes - 20 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
118 runs0 likes0 downloads0 reach0 impact
228 instances - 9 features - 2 classes - 20 missing values
Context Hi, The original Dataset wad published by Frankcc in the following link: Link Kaggle The dataset is involved into the analysis of depression. The data was consists as a study about the life…
0 runs0 likes0 downloads0 reach0 impact
1429 instances - 23 features - classes - 20 missing values
Context "In the Bechdel Cast, the question asked, do movies have women in them? Are all their discussions involve boyfriends or husbands or do they have individualism? The Patriarchy's vast. Let's…
0 runs0 likes0 downloads0 reach0 impact
194 instances - 14 features - classes - 20 missing values
Database of baseball players and play statistics, including 'Games_played', 'At_bats', 'Runs', 'Hits', 'Doubles', 'Triples', 'Home_runs', 'RBIs', 'Walks', 'Strikeouts', 'Batting_average',…
795 runs0 likes0 downloads0 reach0 impact
1340 instances - 17 features - 3 classes - 20 missing values
Palmer Penguins Dataset The goal of palmerpenguins is to provide a great dataset for data exploration visualization, as an alternative to iris. About the data Data were collected and made available by…
0 runs0 likes0 downloads0 reach0 impact
344 instances - 7 features - classes - 19 missing values
![palmerpenguins](https://github.com/allisonhorst/palmerpenguins/raw/master/man/figures/logo.png) ## Description The goal of palmerpenguins is to provide a great dataset for data exploration &…
0 runs0 likes0 downloads0 reach0 impact
344 instances - 7 features - 3 classes - 18 missing values
#modelage
31 runs0 likes0 downloads0 reach0 impact
202 instances - 20 features - 2 classes - 17 missing values
Twenty two observations of the Dwarf planet Ceres as observed by Giueseppe Piazzi and published in the September edition of Monatlicher Correspondenz in 1801. These were the measurements used by Gauss…
0 runs0 likes0 downloads0 reach0 impact
22 instances - 9 features - classes - 17 missing values
Current dataset was adapted to ARFF format from the UCI version. Sample code ID's were removed. ! Note that there is also a related Breast Cancer Wisconsin (Diagnosis) Data Set with a different set of…
28732 runs0 likes0 downloads0 reach0 impact
699 instances - 10 features - 2 classes - 16 missing values
Context Cytology features of breast cancer biopsy. It can be used to predict breast cancer from cytology features. The data was obtained from…
0 runs0 likes0 downloads0 reach0 impact
699 instances - 11 features - classes - 16 missing values
Context So it's Halloween again dear Kagglers! And what better way of celebrating than with some NLP! The dataset brings you the reviews of popular Halloween costumes sold on amazon as of November…
0 runs0 likes0 downloads0 reach0 impact
7814 instances - 5 features - classes - 16 missing values
February 23, 1982 The 1982 annual meetings of the American Statistical Association (ASA) will be held August 16-19, 1982 in Cincinnati. At that meeting, the ASA Committee on Statistical Graphics plans…
759 runs0 likes0 downloads0 reach0 impact
209 instances - 9 features - 2 classes - 15 missing values
One of the primary challenges in identifying the risks of the Burst Header Packet (BHP) flood attacks in Optical Burst Switching networks (OBS) is the scarcity of reliable historical data. ###…
0 runs0 likes0 downloads0 reach0 impact
1075 instances - 22 features - classes - 15 missing values
The Committee on Statistical Graphics of the American Statistical Association (ASA) invites you to participate in its Second (1983) Exposition of Statistical Graphics Technology. The purposes of the…
164 runs0 likes0 downloads0 reach0 impact
406 instances - 8 features - 3 classes - 14 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
60 instances - 11 features - 0 classes - 14 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
683 runs0 likes0 downloads0 reach0 impact
60 instances - 11 features - 2 classes - 14 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
718 runs0 likes0 downloads0 reach0 impact
406 instances - 9 features - 2 classes - 14 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Attributes 2,4, and 6 deleted. Midrange price treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M.…
0 runs0 likes0 downloads0 reach0 impact
93 instances - 23 features - 0 classes - 14 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
730 runs0 likes0 downloads0 reach0 impact
93 instances - 23 features - 2 classes - 14 missing values
Context This Data set Contains the values of Stock of Amazon which dates between 1st October,2019 to 14 July,2020 Content The Data set contains 7 different columns that includes Date, The opening…
0 runs0 likes0 downloads0 reach0 impact
202 instances - 8 features - classes - 13 missing values
Context This dataset provides a list of lyrics from 1950 to 2019 describing music metadata as sadness, danceability, loudness, acousticness, etc. Authors also provide some information as lyrics which…
0 runs0 likes0 downloads0 reach0 impact
28372 instances - 31 features - classes - 13 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) Data from which conclusions were drawn in the article "Sleep in Mammals: Ecological and Constitutional Correlates" by Allison, T. and Cicchetti, D.…
0 runs0 likes0 downloads0 reach0 impact
62 instances - 8 features - 0 classes - 12 missing values
Context "Predict behavior to retain customers. You can analyze all relevant customer data and develop focused customer retention programs." [IBM Sample Data Sets] Content Each row represents a…
0 runs0 likes0 downloads0 reach0 impact
7043 instances - 20 features - 2 classes - 11 missing values
Arbres urbains
0 runs0 likes0 downloads0 reach0 impact
1 instances - 57 features - 1 classes - 11 missing values
Context Retail dataset of a global superstore for 4 years. Perform EDA and Predict the sales of the next 7 days from the last date of the Training dataset! Content Time series analysis deals with time…
0 runs0 likes0 downloads0 reach0 impact
9800 instances - 18 features - classes - 11 missing values
Content The Dataset contains the summary of each player. on the basis of the summary of each player you have to predict the target variable Target Variable: 1-Signifies whether a player has a career…
0 runs0 likes0 downloads0 reach0 impact
1340 instances - 21 features - 0 classes - 11 missing values
Context There's a story behind every dataset and here's your opportunity to share yours. Content What's inside is more than just rows and columns. Make it easy for others to get started by describing…
0 runs0 likes0 downloads0 reach0 impact
205 instances - 23 features - classes - 10 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
50 runs0 likes0 downloads0 reach0 impact
95 instances - 8 features - 5 classes - 9 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
0 runs0 likes0 downloads0 reach0 impact
163 instances - 27 features - 5 classes - 9 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Tumor-size treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
0 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 0 classes - 9 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
708 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 2 classes - 9 missing values
Citation Request: This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
2009 runs0 likes0 downloads0 reach0 impact
286 instances - 10 features - 2 classes - 9 missing values
### Attribute Information * The first column is the class label (1 for signal, 0 for background) * 21 low-level features (kinematic properties): lepton pT, lepton eta, lepton phi, missing energy…
14393 runs0 likes0 downloads0 reach0 impact
98050 instances - 29 features - 2 classes - 9 missing values
Data Set Information: The data has been produced using Monte Carlo simulations. The first 21 features (columns 2-22) are kinematic properties measured by the particle detectors in the accelerator. The…
0 runs0 likes0 downloads0 reach0 impact
98050 instances - 29 features - 0 classes - 9 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
119 runs0 likes0 downloads0 reach0 impact
95 instances - 9 features - 2 classes - 9 missing values
The dataset contains the premier league matches for the season 2014-2015.
0 runs0 likes0 downloads0 reach0 impact
380 instances - 38 features - classes - 9 missing values
n
0 runs0 likes0 downloads0 reach0 impact
5372 instances - 11 features - classes - 9 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
723 runs0 likes0 downloads0 reach0 impact
366 instances - 35 features - 2 classes - 8 missing values
Context The World Health Organization (WHO) declared the 201920 coronavirus outbreak a pandemic and a Public Health Emergency of International Concern (PHEIC). Evidence of local transmission of the…
0 runs0 likes0 downloads0 reach0 impact
1010 instances - 7 features - classes - 8 missing values
Context While brainstorming ideas for a statistics project for a course last semester, the idea of utilizing data about microbreweries came up. Unfortunately after some exploration and thought, we…
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 6 features - classes - 8 missing values
ContextWhilebrainstormingideasforastatisticsprojectforacourselastsemestertheideaofutilizingdataaboutmicrobreweriescameupUnfortunatelyaftersomeexplorationandthoughtwescrappedtheideabutIvegotthisdatatoshowforitanywayContentDatawasacquiredfromhttpswwwbeermonthclubcomandcontainsinformationaboutbreweriesavailableonthesiteasofOctober1st2019NametypeofbreweryaddresswebsiteandstateareamongthedataincludedCodeusedtoscrapeisavailableathttpsgithubcombrkurzawabrewpubscraperAcknowledgementsThankstobeermonthclubforallowingthebrewpubsectiontobescrapedInspirationThisdataaloneisnttooinspiringbutmaybeitcouldbeusedasthebasisforananalysisprojectthatlooksatinformationsuchasyelpreviewsornumberoflinkstoabreweryswebsite…
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 6 features - classes - 8 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
670 runs0 likes0 downloads0 reach0 impact
62 instances - 8 features - 2 classes - 8 missing values
1. Title: Dermatology Database 2. Source Information: (a) Original owners: -- 1. Nilsel Ilter, M.D., Ph.D., Gazi University, School of Medicine 06510 Ankara, Turkey Phone: +90 (312) 214 1080 -- 2. H.…
1756 runs0 likes0 downloads0 reach0 impact
366 instances - 35 features - 6 classes - 8 missing values
Context While brainstorming ideas for a statistics project for a course last semester, the idea of utilizing data about microbreweries came up. Unfortunately after some exploration and thought, we…
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 6 features - classes - 8 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
707 runs0 likes0 downloads0 reach0 impact
52 instances - 25 features - 2 classes - 7 missing values
Squash Harvest Stored Data source: Winna Harvey Crop and Food Research, Christchurch, New Zealand The purpose of the research was to determine the changes taking place in squash fruit during the…
867 runs0 likes0 downloads0 reach0 impact
52 instances - 25 features - 3 classes - 7 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1763 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 2 classes - 7 missing values
ThedatahasbeenreleasedbySDSNandextractedbyPromptCloudscustomwebcrawlingsolutionContextTheWorldHappinessReportisalandmarksurveyofthestateofglobalhappinessthatranks156countriesbyhowhappytheircitizensperceivethemselvestobeThisyearsWorldHappinessReportfocusesonhappinessandthecommunityhowhappinesshasevolvedoverthepastdozenyearswithafocusonthetechnologiessocialnormsconflictsandgovernmentpoliciesthathavedriventhosechangesContentWhatisDystopiaDystopiaisanimaginarycountrythathastheworldsleasthappypeopleThepurposeinestablishingDystopiaistohaveabenchmarkagainstwhichallcountriescanbefavorablycomparednocountryperformsmorepoorlythanDystopiaintermsofeachofthesixkeyvariablesthusallowingeachsubbartobeofpositiveorzeroinsixinstanceswidthThelowestscoresobservedforthesixkeyvariablesthereforecharacterizeDystopiaSincelifewouldbeveryunpleasantinacountrywiththeworldslowestincomeslowestlifeexpectancylowestgenerositymostcorruptionleastfreedomandleastsocialsupportitisreferredtoasDystopiaincontrasttoUtopiaWhataretheresidualsTheresidualsorunexplainedcomponentsdifferforeachcountryreflectingtheextenttowhichthesixvariableseitheroverorunderexplainaverage20162018lifeevaluationsTheseresidualshaveanaveragevalueofapproximatelyzerooverthewholesetofcountriesFigure27showstheaverageresidualforeachcountryiftheequationinTable21isappliedtoaverage20162018dataforthesixvariablesinthatcountryWecombinetheseresidualswiththeestimateforlifeevaluationsinDystopiasothatthecombinedbarwillalwayshavepositivevaluesAscanbeseeninFigure27althoughsomelifeevaluationresidualsarequitelargeoccasionallyexceedingonepointonthescalefrom0to10theyarealwaysmuchsmallerthanthecalculatedvalueinDystopiawheretheaveragelifeisratedat188onthe0to10scaleTable7oftheonlineStatisticalAppendix1forChapter2putstheDystopiaplusresidualblockattheleftsideandalsodrawstheDystopialinemakingiteasytocomparethesignsandsizesoftheresidualsindifferentcountriesWhydoweusethesesixfactorstoexplainlifeevaluationsThevariablesusedreflectwhathasbeenbroadlyfoundintheresearchliteraturetobeimportantinexplainingnationalleveldifferencesinlifeevaluationsSomeimportantvariablessuchasunemploymentorinequalitydonotappearbecausecomparableinternationaldataarenotyetavailableforthefullsampleofcountriesThevariablesareintendedtoillustrateimportantlinesofcorrelationratherthantoreflectcleancausalestimatessincesomeofthedataaredrawnfromthesamesurveysourcessomearecorrelatedwitheachotherorwithotherimportantfactorsforwhichwedonothavemeasuresandinseveralinstancestherearelikelytobetwowayrelationsbetweenlifeevaluationsandthechosenvariablesforexamplehealthypeopleareoverallhappierbutasChapter4intheWorldHappinessReport2013demonstratedhappierpeopleareoverallhealthierInStatisticalAppendix1ofWorldHappinessReport2018weassessedthepossibleimportanceofusingexplanatorydatafromthesamepeoplewhoselifeevaluationsarebeingexplainedWedidthisbyrandomlydividingthesamplesintotwogroupsandusingtheaveragevaluesforegfreedomgleanedfromonegrouptoexplainthelifeevaluationsoftheothergroupThisloweredtheeffectsbutonlyveryslightlyeg2to3assuringusthatusingdatafromthesameindividualsisnotseriouslyaffectingtheresultsDatasourcehttpworldhappinessreported2019MoresuchdatasetscanbedownloadedfromDataStock…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - classes - 7 missing values
Context Some camera enthusiast went and described 1,000 cameras based on 13 properties! Content Row one describes the datatype for each column and can probably be removed. The 13 properties of each…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - 0 classes - 7 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
717 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 2 classes - 7 missing values
One of the data sets used in the book "Analyzing Categorical Data" by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. Further details concerning the book, including information on statistical…
0 runs0 likes0 downloads0 reach0 impact
30 instances - 7 features - 0 classes - 6 missing values
analcatdata A collection of data sets used in the book "Analyzing Categorical Data," by Jeffrey S. Simonoff, Springer-Verlag, New York, 2003. The submission consists of a zip file containing two…
18 runs0 likes0 downloads0 reach0 impact
159 instances - 10 features - 0 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
705 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 2 classes - 6 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
720 runs0 likes0 downloads0 reach0 impact
159 instances - 10 features - 2 classes - 6 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
175 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Identifier attribute deleted. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using instance-based…
2 runs0 likes0 downloads0 reach0 impact
398 instances - 8 features - 0 classes - 6 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
48 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values