OpenML
Filter results by:
ThedatahasbeenreleasedbySDSNandextractedbyPromptCloudscustomwebcrawlingsolutionContextTheWorldHappinessReportisalandmarksurveyofthestateofglobalhappinessthatranks156countriesbyhowhappytheircitizensperceivethemselvestobeThisyearsWorldHappinessReportfocusesonhappinessandthecommunityhowhappinesshasevolvedoverthepastdozenyearswithafocusonthetechnologiessocialnormsconflictsandgovernmentpoliciesthathavedriventhosechangesContentWhatisDystopiaDystopiaisanimaginarycountrythathastheworldsleasthappypeopleThepurposeinestablishingDystopiaistohaveabenchmarkagainstwhichallcountriescanbefavorablycomparednocountryperformsmorepoorlythanDystopiaintermsofeachofthesixkeyvariablesthusallowingeachsubbartobeofpositiveorzeroinsixinstanceswidthThelowestscoresobservedforthesixkeyvariablesthereforecharacterizeDystopiaSincelifewouldbeveryunpleasantinacountrywiththeworldslowestincomeslowestlifeexpectancylowestgenerositymostcorruptionleastfreedomandleastsocialsupportitisreferredtoasDystopiaincontrasttoUtopiaWhataretheresidualsTheresidualsorunexplainedcomponentsdifferforeachcountryreflectingtheextenttowhichthesixvariableseitheroverorunderexplainaverage20162018lifeevaluationsTheseresidualshaveanaveragevalueofapproximatelyzerooverthewholesetofcountriesFigure27showstheaverageresidualforeachcountryiftheequationinTable21isappliedtoaverage20162018dataforthesixvariablesinthatcountryWecombinetheseresidualswiththeestimateforlifeevaluationsinDystopiasothatthecombinedbarwillalwayshavepositivevaluesAscanbeseeninFigure27althoughsomelifeevaluationresidualsarequitelargeoccasionallyexceedingonepointonthescalefrom0to10theyarealwaysmuchsmallerthanthecalculatedvalueinDystopiawheretheaveragelifeisratedat188onthe0to10scaleTable7oftheonlineStatisticalAppendix1forChapter2putstheDystopiaplusresidualblockattheleftsideandalsodrawstheDystopialinemakingiteasytocomparethesignsandsizesoftheresidualsindifferentcountriesWhydoweusethesesixfactorstoexplainlifeevaluationsThevariablesusedreflectwhathasbeenbroadlyfoundintheresearchliteraturetobeimportantinexplainingnationalleveldifferencesinlifeevaluationsSomeimportantvariablessuchasunemploymentorinequalitydonotappearbecausecomparableinternationaldataarenotyetavailableforthefullsampleofcountriesThevariablesareintendedtoillustrateimportantlinesofcorrelationratherthantoreflectcleancausalestimatessincesomeofthedataaredrawnfromthesamesurveysourcessomearecorrelatedwitheachotherorwithotherimportantfactorsforwhichwedonothavemeasuresandinseveralinstancestherearelikelytobetwowayrelationsbetweenlifeevaluationsandthechosenvariablesforexamplehealthypeopleareoverallhappierbutasChapter4intheWorldHappinessReport2013demonstratedhappierpeopleareoverallhealthierInStatisticalAppendix1ofWorldHappinessReport2018weassessedthepossibleimportanceofusingexplanatorydatafromthesamepeoplewhoselifeevaluationsarebeingexplainedWedidthisbyrandomlydividingthesamplesintotwogroupsandusingtheaveragevaluesforegfreedomgleanedfromonegrouptoexplainthelifeevaluationsoftheothergroupThisloweredtheeffectsbutonlyveryslightlyeg2to3assuringusthatusingdatafromthesameindividualsisnotseriouslyaffectingtheresultsDatasourcehttpworldhappinessreported2019MoresuchdatasetscanbedownloadedfromDataStock…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - classes - 7 missing values
Cryptocurrencies Cryptocurrencies are fast becoming rivals to traditional currency across the world. The digital currencies are available to purchase in many different places, making it accessible to…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
Context The consumer credit department of a bank wants to automate the decisionmaking process for approval of home equity lines of credit. To do this, they will follow the recommendations of the Equal…
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 13 features - classes - 5271 missing values
The dataset was created by Angeliki Xifara (angxifara '@' gmail.com, Civil/Structural Engineer) and was processed by Athanasios Tsanas (tsanasthanasis '@' gmail.com, Oxford Centre for Industrial and…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 10 features - classes - 0 missing values
ContextThisdatasetismeanttocomplementtheAustinBikesharingDatasetContentContainstheDateYYYYMMDDTempHighFHightemperatureinFahrenheitTempAvgFAveragetemperatureinFahrenheitTempLowFLowtemperatureinFahrenheitDewPointHighFHighdewpointinFahrenheitDewPointAvgFAveragedewpointinFahrenheitDewPointLowFLowdewpointinFahrenheitHumidityHighPercentHighhumidityasapercentageHumidityAvgPercentAveragehumidityasapercentageHumidityLowPercentLowhumidityasapercentageSeaLevelPressureHighInchesHighsealevelpressureininchesSeaLevelPressureAvgInchesAveragesealevelpressureininchesSeaLevelPressureLowInchesLowsealevelpressureininchesVisibilityHighMilesHighvisibilityinmilesVisibilityAvgMilesAveragevisibilityinmilesVisibilityLowMilesLowvisibilityinmilesWindHighMPHHighwindspeedinmilesperhourWindAvgMPHAveragewindspeedinmilesperhourWindGustMPHHighestwindspeedgustinmilesperhourPrecipitationSumInchesTotalprecipitationininchesTifTraceEventsAdverseweathereventsifNoneThisdatasetcontainsdataforeverydatefrom20131221to20170731AcknowledgementsThisdatasetwasobtainedfromWeatherUndergroundcomattheAustinKATTstationInspirationCanweusethisdatasettoexplainsomeofthevariationintheAustinBikesharingDataset…
0 runs0 likes0 downloads0 reach0 impact
1319 instances - 21 features - classes - 0 missing values
CryptocurrenciesCryptocurrenciesarefastbecomingrivalstotraditionalcurrencyacrosstheworldThedigitalcurrenciesareavailabletopurchaseinmanydifferentplacesmakingitaccessibletoeveryoneandwithretailersacceptingvariouscryptocurrenciesitcouldbeasignthatmoneyasweknowitisabouttogothroughamajorchangeInadditiontheblockchaintechnologyonwhichmanycryptocurrenciesarebasedwithitsrevolutionarydistributeddigitalbackbonehasmanyotherpromisingapplicationsImplementationsofsecuredecentralizedsystemscanaidusinconqueringorganizationalissuesoftrustandsecuritythathaveplaguedoursocietythroughouttheagesIneffectwecanfundamentallydisruptindustriescoretoeconomiesbusinessesandsocialstructureseliminatinginefficiencyandhumanerrorContentThedatasetcontainsallhistoricaldailypricesopenhighlowcloseforallcryptocurrencieslistedonCoinMarketCapAcknowledgementsEveryCryptocurrencyDailyMarketPriceIinitiallydevelopedkernelsforthisdatasetbeforemakingmyownscraperanddatasetsothatIcouldkeepitregularlyupdatedCoinMarketCapForthedata…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
ContextTheconsumercreditdepartmentofabankwantstoautomatethedecisionmakingprocessforapprovalofhomeequitylinesofcreditTodothistheywillfollowtherecommendationsoftheEqualCreditOpportunityActtocreateanempiricallyderivedandstatisticallysoundcreditscoringmodelThemodelwillbebasedondatacollectedfromrecentapplicantsgrantedcreditthroughthecurrentprocessofloanunderwritingThemodelwillbebuiltfrompredictivemodelingtoolsbutthecreatedmodelmustbesufficientlyinterpretabletoprovideareasonforanyadverseactionsrejectionsContentTheHomeEquitydatasetHMEQcontainsbaselineandloanperformanceinformationfor5960recenthomeequityloansThetargetBADisabinaryvariableindicatingwhetheranapplicanteventuallydefaultedorwasseriouslydelinquentThisadverseoutcomeoccurredin1189cases20Foreachapplicant12inputvariableswererecordedAcknowledgementsInspirationWhatifyoucanpredictclientswhodefaultontheirloans…
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 13 features - classes - 5271 missing values
ContextThis dataset is meant to complement the Austin Bikesharing Dataset.ContentContains the Date YYYYMMDD TempHighF High temperature, in Fahrenheit TempAvgF Average temperature, in Fahrenheit…
0 runs0 likes0 downloads0 reach0 impact
1319 instances - 21 features - classes - 0 missing values
CryptocurrenciesCryptocurrencies are fast becoming rivals to traditional currency across the world. The digital currencies are available to purchase in many different places, making it accessible to…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
Context This dataset is meant to complement the Austin Bikesharing Dataset. Content Contains the: Date (YYYY-MM-DD) TempHighF (High temperature, in Fahrenheit) TempAvgF (Average temperature, in…
0 runs0 likes0 downloads0 reach0 impact
1319 instances - 21 features - classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
ContextWhilebrainstormingideasforastatisticsprojectforacourselastsemestertheideaofutilizingdataaboutmicrobreweriescameupUnfortunatelyaftersomeexplorationandthoughtwescrappedtheideabutIvegotthisdatatoshowforitanywayContentDatawasacquiredfromhttpswwwbeermonthclubcomandcontainsinformationaboutbreweriesavailableonthesiteasofOctober1st2019NametypeofbreweryaddresswebsiteandstateareamongthedataincludedCodeusedtoscrapeisavailableathttpsgithubcombrkurzawabrewpubscraperAcknowledgementsThankstobeermonthclubforallowingthebrewpubsectiontobescrapedInspirationThisdataaloneisnttooinspiringbutmaybeitcouldbeusedasthebasisforananalysisprojectthatlooksatinformationsuchasyelpreviewsornumberoflinkstoabreweryswebsite…
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 6 features - classes - 8 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
Context The file contains over 11,000 tweets associated with disaster keywords like crash, quarantine, and bush fires as well as the location and keyword itself. The data structure was inherited from…
0 runs0 likes0 downloads0 reach0 impact
11370 instances - 5 features - classes - 3535 missing values
Acknowledgement Thanks to The Concept Center for publishing this article, an analysis, and pulling the data. Background Shark Tank is an ABC show where entrepreneurs around the United States can pitch…
0 runs0 likes0 downloads0 reach0 impact
495 instances - 19 features - classes - 110 missing values
Context A dataset I used to classify tweets about my company. I took tweets and I classified them manually as positive, negative or neutral. Content There are 4 columns : Id : the tweed id, unique.…
0 runs0 likes0 downloads0 reach0 impact
1097 instances - 4 features - classes - 0 missing values
Context Intel Corporation designs, manufactures, and sells essential technologies for the cloud, smart, and connected devices worldwide. The company operates through DCG, IOTG, Mobileye, NSG, PSG,…
0 runs0 likes0 downloads0 reach0 impact
10361 instances - 7 features - classes - 0 missing values
Context In the world of Pokmon academia, one name towers above any other Professor Samuel Oak. While his colleague Professor Elm specializes in Pokmon evolution, Oak has dedicated his career to…
0 runs0 likes0 downloads0 reach0 impact
801 instances - 14 features - 0 classes - 138 missing values
Description This is a countrywide weather events dataset that includes 6.3 million events, and covers 49 states of the United States. Examples of weather events are rain, snow, storm, and freezing…
0 runs0 likes0 downloads0 reach0 impact
7479165 instances - 14 features - classes - 73797 missing values
Context This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective is to predict based on diagnostic measurements whether a patient has…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 9 features - 0 classes - 0 missing values
This dataset contains Apple's (AAPL) stock data for the last 10 years (from 2010 to date). I believe insights from this data can be used to build useful price forecasting algorithms to aid investment.…
0 runs0 likes0 downloads0 reach0 impact
2518 instances - 6 features - classes - 0 missing values
Context This dataset contains audio statistics of the top 2000 tracks on Spotify. The data contains about 15 columns each describing the track and it's qualities. Songs released from 1956 to 2019 are…
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 15 features - classes - 0 missing values
Content This dataset identifies hazardous areas for driving according to harsh braking and accident level events within a specific area. Each month a new set of dangerous driving areas is produced and…
0 runs0 likes0 downloads0 reach0 impact
10939 instances - 18 features - classes - 6089 missing values
Context WallStreetBets (r/wallstreetbets, also known as WSB), is a subreddit where participants discuss stock and option trading. It has become notable for its profane nature and allegations of users…
0 runs0 likes0 downloads0 reach0 impact
53187 instances - 7 features - classes - 28655 missing values
Please, If you enjoyed this dataset, don't forget to upvote it. Content This dataset contains a couple of shows and series are available on Disney+ stream service. Also, this dataset contains Internet…
0 runs0 likes0 downloads0 reach0 impact
992 instances - 19 features - classes - 3433 missing values
Context Craigslist is the world's largest collection of privately sold housing options, yet it's very difficult to collect all of them in the same place. I built this dataset as a means in by which to…
0 runs0 likes0 downloads0 reach0 impact
384977 instances - 22 features - classes - 223551 missing values
Context Social media is a vast pool of content, and among all the content available for users to access, news is an element that is accessed most frequently. These news can be posted by politicians,…
0 runs0 likes0 downloads0 reach0 impact
2096 instances - 12 features - classes - 104 missing values
Oranges vs. Grapefruit The task of separating oranges and grapefruit is fairly obvious to a human, but even with manual observation there is still a bit of error. This dataset takes the color, weight,…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 6 features - classes - 0 missing values
Context Cinema industry is not excluded of getting advantage of predictive modeling. Like other industry it can help cinemas for cost reduction and better ROI. By forecasting sale, screening in…
0 runs0 likes0 downloads0 reach0 impact
142524 instances - 14 features - classes - 250 missing values
Introduction On September 27 1994 the ferry Estonia set sail on a night voyage across the Baltic Sea from the port of Tallin in Estonia to Stockholm. She departed at 19.00 carrying 989 passengers and…
0 runs0 likes0 downloads0 reach0 impact
989 instances - 7 features - 0 classes - 0 missing values
Context Towards Data Science Inc. is a corporation registered in Canada as a platform for thousands of people to exchange ideas and to expand understanding of data science. This dataset contains data…
0 runs0 likes0 downloads0 reach0 impact
46079 instances - 8 features - classes - 3 missing values
Context As Al Hilal football club is one of the most successful football clubs in Saudi Arabia, Middle East and Asia this dataset was an excercise for Web Scraping where i collected the archive of the…
0 runs0 likes0 downloads0 reach0 impact
1363 instances - 8 features - classes - 219 missing values
Description: Pulsars are a rare type of Neutron star that produce radio emission detectable here on Earth. They are of considerable scientific interest as probes of space-time, the interstellar…
0 runs0 likes0 downloads0 reach0 impact
17897 instances - 9 features - classes - 0 missing values
Context This is a dataset i generated during a hackathon for project purpose. Here i have scrapped data from Coursera official web site. Our project aims to help any new learner get the right course…
0 runs0 likes0 downloads0 reach0 impact
891 instances - 7 features - classes - 0 missing values
Context Since as a beginner in machine learning it would be a great opportunity to try some techniques to predict the outcome of the drugs that might be accurate for the patient. Content The target…
0 runs0 likes0 downloads0 reach0 impact
200 instances - 6 features - classes - 0 missing values
Context Retail dataset of a global superstore for 4 years. Perform EDA and Predict the sales of the next 7 days from the last date of the Training dataset! Content Time series analysis deals with time…
0 runs0 likes0 downloads0 reach0 impact
9800 instances - 18 features - classes - 11 missing values
Content Churn for bank customers RowNumbercorresponds to the record (row) number and has no effect on the output. CustomerIdcontains random values and has no effect on customer leaving the bank.…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 14 features - classes - 0 missing values
Context Ethereum a decentralized, open-source blockchain featuring smart contract functionality was proposed in 2013 by programmer Vitalik Buterin. Development was crowdfunded in 2014, and the network…
0 runs0 likes0 downloads0 reach0 impact
2202 instances - 6 features - classes - 0 missing values
Context I am developing my data science skills in areas outside of my previous work. An interesting problem for me was to identify which factors influence life expectancy on a national level. There is…
0 runs0 likes0 downloads0 reach0 impact
3111 instances - 32 features - classes - 15246 missing values
This dataset contains the daily currency exchange rates as reported to the International Monetary Fund by the issuing central bank. Included are 51 currencies over the period from 01-01-1995 to…
0 runs0 likes0 downloads0 reach0 impact
5978 instances - 52 features - classes - 61189 missing values
Context In the last few years, Twitter became one of the most popular social media platforms. From celebrity status to government policies, Twitter can accommodate a diverse range of people and…
0 runs0 likes0 downloads0 reach0 impact
31115 instances - 5 features - classes - 456 missing values
Context Japanese animation, which is known as anime, has become internationally widespread nowadays. This dataset provides data on anime taken from Anime News Network. Content This dataset consists of…
0 runs0 likes0 downloads0 reach0 impact
1563 instances - 20 features - classes - 0 missing values
Context - World Health Organization (WHO) Coronavirus disease (COVID-19) is an infectious disease caused by a newly discovered coronavirus (2019-nCoV). Content This dataset has information on the…
0 runs0 likes0 downloads0 reach0 impact
37272 instances - 6 features - classes - 0 missing values
Context This dataset has an exhaustive list of player statistics for each season from 2013 - 2020 Content Each row is associated with a player and a season. Eg. You will have 7 rows for Lionel Messi:…
0 runs0 likes0 downloads0 reach0 impact
5733 instances - 109 features - classes - 187897 missing values
Context The data obtained from the Mexico's General Direction of Epidemiology contains multiple information on the current pandemic situation. However, these data are saturated with features that may…
0 runs0 likes0 downloads0 reach0 impact
92320 instances - 7 features - classes - 0 missing values
Context League of Legends is a MOBA (multiplayer online battle arena) where 2 teams (blue and red) face off. There are 3 lanes, a jungle, and 5 roles. The goal is to take down the enemy Nexus to win…
0 runs0 likes0 downloads0 reach0 impact
242572 instances - 59 features - classes - 0 missing values
Statistics on a Blockbreaker-like Game The author is in the process of creating a blockbreaker-like game, in which the jumping-off point is the "Block Breaker" section of the Udemy course, Complete C…
0 runs0 likes0 downloads0 reach0 impact
6814 instances - 7 features - classes - 0 missing values
Context Crime rates vary across space and time. The reasons crimes are committed in some places but not others can be difficult to detect because of complex socio-economic factors, but policymakers…
0 runs0 likes0 downloads0 reach0 impact
1008 instances - 38 features - classes - 312 missing values
Context https://www.kaggle.com/c/birdsong-recognition Content This is information captured about birds recording from France Acknowledgements We wouldn't be here without the help of many contributors…
0 runs0 likes0 downloads0 reach0 impact
20851 instances - 26 features - classes - 32566 missing values
P2P Lending I concatenated historical loans from both Prosper and Lending Club 2013 - 2018. Currently only the summary of the loan (terms, origination date, loan amount, status, etc) are up but…
0 runs0 likes0 downloads0 reach0 impact
2875146 instances - 18 features - classes - 863078 missing values
Context COVID19 is spreading across the globe and this data set help in analyzing to what extent the pandemic has affected different countries. Content This data set contains the total number of COVID…
0 runs0 likes0 downloads0 reach0 impact
26355 instances - 6 features - classes - 844 missing values
Content This data is an extract from a bigger reddit dataset (All reddit comments from May 2019, 157Gb or data uncompressed) that contains both more comments and more associated informations…
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 4 features - classes - 1 missing values
Context Daily price information for stocks, aggregated into one big file. Content Data was pulled using an api and contains general price information for all stocks that are tradable. Fields include…
0 runs0 likes0 downloads0 reach0 impact
7801920 instances - 7 features - classes - 0 missing values
This dataset has been scrapped off sa.aqar.fm to obtain land information such as price, size, street width, and locations. The uncleaned dataset scrapped 4347 rows, but seems like 1395 were duplicated…
0 runs0 likes0 downloads0 reach0 impact
2951 instances - 8 features - classes - 8085 missing values
Context "In the Bechdel Cast, the question asked, do movies have women in them? Are all their discussions involve boyfriends or husbands or do they have individualism? The Patriarchy's vast. Let's…
0 runs0 likes0 downloads0 reach0 impact
194 instances - 14 features - classes - 20 missing values
GoodReads Choice Awards Every year GoodReads announces a contest for the best book of the year called GoodReads Choice Awards. Every year there are multiple categories and every reader can vote for a…
0 runs0 likes0 downloads0 reach0 impact
3660 instances - 16 features - classes - 223 missing values
Context Every year Kaggle conducts an industry-wide survey that presents a truly comprehensive view of the state of data science and machine learning. This dataset combines the data from the past 4…
0 runs0 likes0 downloads0 reach0 impact
80327 instances - 12 features - classes - 226439 missing values
Context There's a story behind every dataset and here's your opportunity to share yours. Content What's inside is more than just rows and columns. Make it easy for others to get started by describing…
0 runs0 likes0 downloads0 reach0 impact
205 instances - 23 features - classes - 10 missing values
Context Crude oil price from Yahoo Finance as of 1 February 2021, 10:19PM EST. Market open. NY Mercantile - NY Mercantile Delayed Price. Currency in USD. Content Open: price at market opening High:…
0 runs0 likes0 downloads0 reach0 impact
6274 instances - 7 features - classes - 7044 missing values
Context All credit of this database goes to Tim Sevenhuysen of OraclesElixir.com. Im just uploading it here because I want to see what you guys do with this dataset before Worlds! Im super hyped!…
0 runs0 likes0 downloads0 reach0 impact
67980 instances - 103 features - classes - 1527593 missing values
Context This dataset is a mixage of Fifa 19 Football Player Dataset and Real World Statistics about of them. Content You can find whatever you want from a soccer data like Goals,Assists,Preferred foot…
0 runs0 likes0 downloads0 reach0 impact
491 instances - 53 features - classes - 0 missing values
The Story This data set was part of my online course material for Data Analysis using Python over at Udemy. The Contents The dataset is very useful for beginners and novice number crunchers looking to…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 13 features - classes - 0 missing values
Context Explore an environmental conditions dataframe scraped from CIMIS weather stations using a selenium chromedriver. With California's wildfires setting records in 2020, it is worthwhile to…
0 runs0 likes0 downloads0 reach0 impact
128125 instances - 19 features - 0 classes - 138 missing values
Among all industries, insurance domain has the largest use of analytics data science methods. This data set would provide you enough taste of working on data sets from insurance companies, what…
0 runs0 likes0 downloads0 reach0 impact
614 instances - 12 features - 2 classes - 149 missing values
Context Models of human cognition hold that information processing occurs in a series of stages. Cognitive psychology, in particular, is concerned with the internal mental processes that begin with…
0 runs0 likes0 downloads0 reach0 impact
6854 instances - 23 features - classes - 16924 missing values
Context Based on 250K lyrics database. Created to perform Supervised NLP sentiment analysis task using Spotify valence audio feature, a measure of the positiveness of the song. Content Preparation of…
0 runs0 likes0 downloads0 reach0 impact
158353 instances - 5 features - classes - 2 missing values
This data set consists of Placement data, of students in a XYZ campus. It includes secondary and higher secondary school percentage and specialisation. It also includes degree specialisation, type and…
0 runs0 likes0 downloads0 reach0 impact
215 instances - 15 features - classes - 67 missing values
Context The No Show problem is one of the bigest on the health industry, about 30 of the patient fail theirs appointments. Content 61K points, from 2017.01.01 to 2017.04.30 and 19 features to work…
0 runs0 likes0 downloads0 reach0 impact
61214 instances - 19 features - 0 classes - 0 missing values
Context Last time I built an LSTM price prediction for Corn, but the result is not satisfactory. I would like to try other algorithm and data. So I decided to use Wheat price for the exercise. And…
0 runs0 likes0 downloads0 reach0 impact
2272 instances - 5 features - classes - 0 missing values
Content This database contains six basic emotions (happiness, surprise, anger, fear, disgust, and sadness) of normalized (average mean reference) data and collected from 85 undergraduate university…
0 runs0 likes0 downloads0 reach0 impact
190967 instances - 11 features - classes - 0 missing values
Context The IMDB Movies Dataset contains information about 5834 movies. Information about these movies was scraped from imdb for purpose of creating a movie recommendation model. The data was…
0 runs0 likes0 downloads0 reach0 impact
5834 instances - 12 features - classes - 121 missing values
It is estimated that 10,000 people die each year worldwide due to hurricanes and tropical storms. The majority of human deaths are caused by flooding. Hurricane Irma hit Florida as a Category 4 storm…
0 runs0 likes0 downloads0 reach0 impact
5021 instances - 23 features - 2 classes - 91 missing values
Context Finding similarities between different types of public figures based in their Twitter activity. . Content usuario - username op = Openness to experience co =Conscientiousness ex = Extraversion…
0 runs0 likes0 downloads0 reach0 impact
140 instances - 8 features - classes - 0 missing values
Problem Statement Welcome to Sigma Cab Private Limited - a cab aggregator service. Their customers can download their app on smartphones and book a cab from any where in the cities they operate in.…
0 runs0 likes0 downloads0 reach0 impact
131662 instances - 14 features - classes - 137546 missing values
Global Cause of the Deaths other than diseases This data was part of the project Global Disease Burden 2017. Data contain the number of deaths within a country and each year along with cause of deaths…
0 runs0 likes0 downloads0 reach0 impact
36860 instances - 10 features - classes - 47180 missing values
Dataset is from http://tomslee.net/airbnb-data-collection-get-the-data room_id: A unique number identifying an Airbnb listing. The listing has a URL on the Airbnb web site of…
0 runs0 likes0 downloads0 reach0 impact
13578 instances - 20 features - classes - 54347 missing values
Context I built it on https://www.kaggle.com/zynicide/wine-reviews updating the scraper fetching fresh data. Content All wine reviews from winemag.com for 2017-2020 years. Duplicates are cleared,…
0 runs0 likes0 downloads0 reach0 impact
81115 instances - 15 features - classes - 90161 missing values
Context In the dataset freMTPL2freq risk features and claim numbers were collected for 677,991 motor third-part liability policies (observed on a year). Content freMTPL2freq contains 11 columns…
0 runs0 likes0 downloads0 reach0 impact
678013 instances - 11 features - classes - 0 missing values
Context The World Happiness Ranking focuses on the social, urban, and natural environment. Specifically, the ranking relies on self-reports from residents of how they weigh the quality of life they…
0 runs0 likes0 downloads0 reach0 impact
153 instances - 20 features - classes - 0 missing values
This dataset is the resulting cleaned version of Panos Kostakos's Risk of being drawn into online sex work dataset. Context This database was used in the paper: "Covert online ethnography and machine…
0 runs0 likes0 downloads0 reach0 impact
28831 instances - 30 features - classes - 54259 missing values
Context Cytology features of breast cancer biopsy. It can be used to predict breast cancer from cytology features. The data was obtained from…
0 runs0 likes0 downloads0 reach0 impact
699 instances - 11 features - classes - 16 missing values
These data are the results of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of 13 constituents found…
0 runs0 likes0 downloads0 reach0 impact
178 instances - 14 features - classes - 2 missing values
Context Tumor microRNA expression profiling identifies circulating microRNAs for earlier breast cancer detection. Due to microRNA role in tumorigenesis and remarkable stability in body fluids,…
0 runs0 likes0 downloads0 reach0 impact
133 instances - 1928 features - classes - 0 missing values
Naturally, I am myself an NBA fan and I don't prefer when I have data that has missing information. But not missing as like missing instance in columns, I mean the whole columns that I could use in my…
0 runs0 likes0 downloads0 reach0 impact
54514 instances - 142 features - classes - 0 missing values
Context Train your project by getting weather data from Istanbul. Content You can find daily weather, rainfall, sunrise time, sunset time, moonrise time, sunset time, average wind speed, average…
0 runs0 likes0 downloads0 reach0 impact
3896 instances - 12 features - classes - 263 missing values
Context METAR is a format for reporting weather information. A METAR weather report is predominantly used by pilots in fulfillment of a part of a pre-flight weather briefing, and by meteorologists,…
0 runs0 likes0 downloads0 reach0 impact
8787 instances - 13 features - classes - 6960 missing values
Context This is the dataset used in the second chapter of Aurlien Gron's recent book 'Hands-On Machine learning with Scikit-Learn and TensorFlow'. It serves as an excellent introduction to…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 10 features - classes - 207 missing values
Uber vs Lyft This is a very beginner-friendly dataset. It does contain a lot of NA values. It is a good dataset if you want to use a Linear Regression Model to see the pattern between different…
0 runs0 likes0 downloads0 reach0 impact
693071 instances - 56 features - classes - 55095 missing values
Context Census of India is a rich database which can tell stories of over a billion Indians. It is important not only for research point of view, but commercially as well for the organizations that…
0 runs0 likes0 downloads0 reach0 impact
590 instances - 82 features - classes - 3219 missing values
Context At Shadow Robot, we are leaders in robotic grasping and manipulation. As part of our Smart Grasping System development, we're developing different algorithms using machine learning. This first…
0 runs0 likes0 downloads0 reach0 impact
992641 instances - 29 features - classes - 0 missing values
Context This dataset contains information on all 802 Pokemon from all Seven Generations of Pokemon. The information contained in this dataset include Base Stats, Performance against Other Types,…
0 runs0 likes0 downloads0 reach0 impact
801 instances - 41 features - classes - 522 missing values
Context World fact sheet, fun to link with other datasets. Content Information on population, region, area size, infant mortality and more. Acknowledgements Source: All these data sets are made up of…
0 runs0 likes0 downloads0 reach0 impact
227 instances - 20 features - classes - 110 missing values
Context The ecological footprint measures the ecological assets that a given population requires to produce the natural resources it consumes (including plant-based food and fiber products, livestock…
0 runs0 likes0 downloads0 reach0 impact
188 instances - 21 features - classes - 181 missing values
Context Pantheon is a project celebrating the cultural information that endows our species with these fantastic capacities. To celebrate our global cultural heritage we are compiling, analyzing and…
0 runs0 likes0 downloads0 reach0 impact
11341 instances - 17 features - classes - 11329 missing values
Overview Identification COUNTRY Rwanda TITLE Integrated Household Living Conditions Survey 2010-2011 TRANSLATED TITLE Enqute Intgrale sur les conditions de vie des mnages 2010-2011 STUDY TYPE…
0 runs0 likes0 downloads0 reach0 impact
78432 instances - 23 features - classes - 907612 missing values
Acknowledgements The data was scraped from Booking.com. All data in the file is publicly available to everyone already. Data is originally owned by Booking.com. Please contact me through my profile if…
0 runs0 likes0 downloads0 reach0 impact
515738 instances - 17 features - classes - 6536 missing values
Context This is a dataset for a larger project I have been working on. My idea is to analyze and compare real historical weather with weather folklore. Content The CSV file includes a hourly/daily…
0 runs0 likes0 downloads0 reach0 impact
96453 instances - 12 features - classes - 517 missing values
Context Some camera enthusiast went and described 1,000 cameras based on 13 properties! Content Row one describes the datatype for each column and can probably be removed. The 13 properties of each…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - 0 classes - 7 missing values