OpenML
Filter results by:
Context Microstrip Antennas are low-profile antennas applied in high-performance aircraft, spacecraft, satellite and missile applications, where size, weight, performance, ease of installation and…
0 runs0 likes0 downloads0 reach0 impact
572 instances - 13 features - classes - 63 missing values
Context This is a forked subset of the Uber Pickups in New York City, enriched with weather, borough, and holidays information. Content I created this dataset for a personal project of exploring and…
0 runs0 likes0 downloads0 reach0 impact
29101 instances - 13 features - classes - 3043 missing values
The data comprises the brand, type and model. For each car we have information about the cubic capacity (in cm3), the maximal power of engine (in kW), the maximal torque (in Nm), the number of seats,…
0 runs0 likes0 downloads0 reach0 impact
475 instances - 13 features - classes - 24 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
4970 instances - 13 features - 2 classes - 0 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
Annual salary information including gross pay and overtime pay for all active, permanent employees of Montgomery County, MD paid in calendar year 2016. This information will be published annually each…
0 runs0 likes0 downloads0 reach0 impact
9228 instances - 13 features - 0 classes - 11169 missing values
This dataset consists of beer reviews from Beeradvocate. The data span a period of more than 10 years, including all ~1.5 million reviews up to November 2011. Each review includes ratings in terms of…
0 runs0 likes0 downloads0 reach0 impact
1586614 instances - 13 features - 104 classes - 68148 missing values
Forest Fires Data Set This is a difficult regression task, where the aim is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other data.…
0 runs0 likes0 downloads0 reach0 impact
517 instances - 13 features - 0 classes - 0 missing values
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies…
0 runs0 likes0 downloads0 reach0 impact
231 instances - 13 features - classes - 46 missing values
This dataset contains 10962 houses to rent with 13 diferent features. Some values in the dataset can be considered as outliers for further analyses. Bear in mind that the Web Crawler was used only to…
0 runs0 likes0 downloads0 reach0 impact
10692 instances - 13 features - 0 classes - 0 missing values
We scraped a large number of eBay auctions of a popular product. After preprocessing the auction data, we build the SB dataset. The goal is to share the labelled SB dataset with the researchers.
0 runs0 likes0 downloads0 reach0 impact
6321 instances - 13 features - 0 classes - 0 missing values
This hourly data set contains the PM2.5 data of US Embassy in Beijing. Meanwhile, meteorological data from Beijing Capital International Airport are also included.
0 runs0 likes0 downloads0 reach0 impact
43824 instances - 13 features - classes - 2067 missing values
ThedatahasbeenreleasedbySDSNandextractedbyPromptCloudscustomwebcrawlingsolutionContextTheWorldHappinessReportisalandmarksurveyofthestateofglobalhappinessthatranks156countriesbyhowhappytheircitizensperceivethemselvestobeThisyearsWorldHappinessReportfocusesonhappinessandthecommunityhowhappinesshasevolvedoverthepastdozenyearswithafocusonthetechnologiessocialnormsconflictsandgovernmentpoliciesthathavedriventhosechangesContentWhatisDystopiaDystopiaisanimaginarycountrythathastheworldsleasthappypeopleThepurposeinestablishingDystopiaistohaveabenchmarkagainstwhichallcountriescanbefavorablycomparednocountryperformsmorepoorlythanDystopiaintermsofeachofthesixkeyvariablesthusallowingeachsubbartobeofpositiveorzeroinsixinstanceswidthThelowestscoresobservedforthesixkeyvariablesthereforecharacterizeDystopiaSincelifewouldbeveryunpleasantinacountrywiththeworldslowestincomeslowestlifeexpectancylowestgenerositymostcorruptionleastfreedomandleastsocialsupportitisreferredtoasDystopiaincontrasttoUtopiaWhataretheresidualsTheresidualsorunexplainedcomponentsdifferforeachcountryreflectingtheextenttowhichthesixvariableseitheroverorunderexplainaverage20162018lifeevaluationsTheseresidualshaveanaveragevalueofapproximatelyzerooverthewholesetofcountriesFigure27showstheaverageresidualforeachcountryiftheequationinTable21isappliedtoaverage20162018dataforthesixvariablesinthatcountryWecombinetheseresidualswiththeestimateforlifeevaluationsinDystopiasothatthecombinedbarwillalwayshavepositivevaluesAscanbeseeninFigure27althoughsomelifeevaluationresidualsarequitelargeoccasionallyexceedingonepointonthescalefrom0to10theyarealwaysmuchsmallerthanthecalculatedvalueinDystopiawheretheaveragelifeisratedat188onthe0to10scaleTable7oftheonlineStatisticalAppendix1forChapter2putstheDystopiaplusresidualblockattheleftsideandalsodrawstheDystopialinemakingiteasytocomparethesignsandsizesoftheresidualsindifferentcountriesWhydoweusethesesixfactorstoexplainlifeevaluationsThevariablesusedreflectwhathasbeenbroadlyfoundintheresearchliteraturetobeimportantinexplainingnationalleveldifferencesinlifeevaluationsSomeimportantvariablessuchasunemploymentorinequalitydonotappearbecausecomparableinternationaldataarenotyetavailableforthefullsampleofcountriesThevariablesareintendedtoillustrateimportantlinesofcorrelationratherthantoreflectcleancausalestimatessincesomeofthedataaredrawnfromthesamesurveysourcessomearecorrelatedwitheachotherorwithotherimportantfactorsforwhichwedonothavemeasuresandinseveralinstancestherearelikelytobetwowayrelationsbetweenlifeevaluationsandthechosenvariablesforexamplehealthypeopleareoverallhappierbutasChapter4intheWorldHappinessReport2013demonstratedhappierpeopleareoverallhealthierInStatisticalAppendix1ofWorldHappinessReport2018weassessedthepossibleimportanceofusingexplanatorydatafromthesamepeoplewhoselifeevaluationsarebeingexplainedWedidthisbyrandomlydividingthesamplesintotwogroupsandusingtheaveragevaluesforegfreedomgleanedfromonegrouptoexplainthelifeevaluationsoftheothergroupThisloweredtheeffectsbutonlyveryslightlyeg2to3assuringusthatusingdatafromthesameindividualsisnotseriouslyaffectingtheresultsDatasourcehttpworldhappinessreported2019MoresuchdatasetscanbedownloadedfromDataStock…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - classes - 7 missing values
Context The consumer credit department of a bank wants to automate the decisionmaking process for approval of home equity lines of credit. To do this, they will follow the recommendations of the Equal…
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 13 features - classes - 5271 missing values
ContextTheconsumercreditdepartmentofabankwantstoautomatethedecisionmakingprocessforapprovalofhomeequitylinesofcreditTodothistheywillfollowtherecommendationsoftheEqualCreditOpportunityActtocreateanempiricallyderivedandstatisticallysoundcreditscoringmodelThemodelwillbebasedondatacollectedfromrecentapplicantsgrantedcreditthroughthecurrentprocessofloanunderwritingThemodelwillbebuiltfrompredictivemodelingtoolsbutthecreatedmodelmustbesufficientlyinterpretabletoprovideareasonforanyadverseactionsrejectionsContentTheHomeEquitydatasetHMEQcontainsbaselineandloanperformanceinformationfor5960recenthomeequityloansThetargetBADisabinaryvariableindicatingwhetheranapplicanteventuallydefaultedorwasseriouslydelinquentThisadverseoutcomeoccurredin1189cases20Foreachapplicant12inputvariableswererecordedAcknowledgementsInspirationWhatifyoucanpredictclientswhodefaultontheirloans…
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 13 features - classes - 5271 missing values
The Story This data set was part of my online course material for Data Analysis using Python over at Udemy. The Contents The dataset is very useful for beginners and novice number crunchers looking to…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 13 features - classes - 0 missing values
Context METAR is a format for reporting weather information. A METAR weather report is predominantly used by pilots in fulfillment of a part of a pre-flight weather briefing, and by meteorologists,…
0 runs0 likes0 downloads0 reach0 impact
8787 instances - 13 features - classes - 6960 missing values
Context Some camera enthusiast went and described 1,000 cameras based on 13 properties! Content Row one describes the datatype for each column and can probably be removed. The 13 properties of each…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - 0 classes - 7 missing values
Dataset used in the tabular data benchmark https://github.com/LeoGrin/tabular-benchmark, transformed in the same way. This dataset belongs to the "classification on categorical and numerical features"…
0 runs0 likes0 downloads0 reach0 impact
4970 instances - 13 features - 2 classes - 0 missing values
tbd
0 runs0 likes0 downloads0 reach0 impact
56 instances - 13 features - 0 classes - 0 missing values
The Computer Activity databases are a collection of computer systems activity measures. The data was collected from a Sun Sparcstation 20/712 with 128 Mbytes of memory running in a multi-user…
12 runs0 likes0 downloads0 reach0 impact
8192 instances - 13 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
753 runs0 likes0 downloads0 reach0 impact
8192 instances - 13 features - 2 classes - 0 missing values
No data.
337 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 3 classes - 0 missing values
#modelage
28 runs0 likes0 downloads0 reach0 impact
202 instances - 13 features - 3 classes - 202 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 13 features - 0 classes - 0 missing values
Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent…
0 runs0 likes0 downloads0 reach0 impact
17379 instances - 13 features - 0 classes - 0 missing values
Pokemons are something which fascinated me every single time. Who would believe that a 6 year old kid used to be late to school almost everyday because of watching those extra minutes of the Pokemon…
0 runs0 likes0 downloads0 reach0 impact
1045 instances - 13 features - 0 classes - 492 missing values
Context This dataset is a playground for fundamental and technical analysis. This can serve as basic tutorial for time-series data analysis. Content Dataset consists of following files:…
0 runs0 likes0 downloads0 reach0 impact
744 instances - 13 features - classes - 2 missing values
Content This is a 10692 rows x 13 columns dataset that contains the different features of a property and its rent Acknowledgements This is a synthetic Indian dataset of the original dataset that has…
0 runs0 likes0 downloads0 reach0 impact
10692 instances - 13 features - classes - 0 missing values
Context This Data set contains various information about a set of cars that were manufactured with set of factory parameters like cylinder size, number of cylinders, fuel consumption, Carbon dioxide…
0 runs0 likes0 downloads0 reach0 impact
679 instances - 13 features - classes - 0 missing values
Data Set Information: A dataset of manually-curated BCF for 779 chemicals was used to determine the mechanisms of bioconcentration, i.e. to predict whether a chemical: (1) is mainly stored within…
0 runs0 likes0 downloads0 reach0 impact
779 instances - 13 features - classes - 0 missing values
Context After the release of Pokmon Sword and Shield on the November 15, 2019, 81 new Pokemons were released alongside 13 regional variants of preexisting Pokmon. Content Like dataset from previous…
0 runs0 likes0 downloads0 reach0 impact
1017 instances - 13 features - classes - 479 missing values
Context This case is about a bank (Thera Bank) whose management wants to explore ways of converting its liability customers to personal loan customers (while retaining them as depositors). A campaign…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 13 features - classes - 0 missing values
Subsampling of the dataset rl (44160) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 13 features - 2 classes - 0 missing values
Subsampling of the dataset rl (44160) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 13 features - 2 classes - 0 missing values
Subsampling of the dataset rl (44160) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 13 features - 2 classes - 0 missing values
Subsampling of the dataset rl (44160) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 13 features - 2 classes - 0 missing values
Subsampling of the dataset rl (44160) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self, seed:…
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 13 features - 2 classes - 0 missing values
daily bike dataset
0 runs0 likes0 downloads0 reach0 impact
731 instances - 13 features - 606 classes - 0 missing values
daily bike dataset
0 runs0 likes0 downloads0 reach0 impact
731 instances - 13 features - 606 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes0 downloads0 reach0 impact
1066 instances - 13 features - 6 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes0 downloads0 reach0 impact
315 instances - 13 features - 5 classes - 0 missing values
No data.
67 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
66 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
No data.
306 runs0 likes0 downloads0 reach0 impact
1000000 instances - 13 features - 6 classes - 0 missing values
The local stability analysis of the 4-node star system (electricity producer is in the center) implementing Decentral Smart Grid Control concept was performed. This dataset contains simulations…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 13 features - 0 classes - 0 missing values
The aim of this dataset is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other data. The output 'area' was first transformed with a…
0 runs0 likes0 downloads0 reach0 impact
517 instances - 13 features - 0 classes - 0 missing values
130k wine reviews with variety, location, winery, price, and description. Downloaded from Kaggle [https://www.kaggle.com/zynicide/wine-reviews/home] on 29.10.2018. The original data was scraped from…
0 runs0 likes1 downloads1 reach9 impact
129971 instances - 13 features - 0 classes - 204752 missing values
Description: The "heart_failure_clinical_records.csv" dataset comprises clinical records of patients with heart failure, detailing various medical attributes that may contribute to heart failure…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 13 features - classes - 0 missing values
Description: This dataset, named "heart_failure_clinical_records.csv", consists of clinical records of patients with heart failure. It includes various attributes such as age, anaemia, creatinine…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 13 features - classes - 0 missing values
Data on the homicide rate in Detroit for the years 1961-1973. This is the data set called DETROIT in the book 'Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series…
0 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values
The Boston house-price data of Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. Used in Belsley, Kuh & Welsch,…
9 runs0 likes0 downloads0 reach0 impact
506 instances - 14 features - 0 classes - 0 missing values
Determinants of Plasma Retinol and Beta-Carotene Levels Summary: Observational studies have suggested that low dietary intake or low plasma concentrations of retinol, beta-carotene, or other…
15 runs0 likes0 downloads0 reach0 impact
315 instances - 14 features - 0 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
10 runs0 likes0 downloads0 reach0 impact
294 instances - 14 features - 0 classes - 782 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
734 runs0 likes0 downloads0 reach0 impact
506 instances - 14 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
780 runs0 likes0 downloads0 reach0 impact
178 instances - 14 features - 2 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
688 runs0 likes0 downloads0 reach0 impact
294 instances - 14 features - 2 classes - 782 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
762 runs0 likes0 downloads0 reach0 impact
315 instances - 14 features - 2 classes - 0 missing values
Datasets of Data And Story Library, project illustrating use of basic statistic methods, converted to arff format by Hakan Kjellerstrand. Source: TunedIT: http://tunedit.org/repo/DASL DASL file…
0 runs0 likes0 downloads0 reach0 impact
47 instances - 14 features - 0 classes - 0 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1763 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 2 classes - 7 missing values
Publication Request: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> This file describes the contents of the heart-disease directory. This directory contains 4 databases…
1792 runs0 likes0 downloads0 reach0 impact
294 instances - 14 features - 2 classes - 782 missing values
This database contains 13 attributes (which have been extracted from a larger set of 75) Attribute Information: ------------------------ -- 1. age -- 2. sex -- 3. chest pain type (4 values) -- 4.…
3215 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - 2 classes - 0 missing values
* Donor: David W. Aha (aha '@' ics.uci.edu) (714) 856-8779 * Data Set Information: This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In…
159 runs0 likes0 downloads0 reach0 impact
200 instances - 14 features - 5 classes - 0 missing values
* Dataset: This is a reprocessed version of heart-h (hungarian), the heart disease reprocessed hungarian dataset from UCI.
138 runs0 likes0 downloads0 reach0 impact
294 instances - 14 features - 5 classes - 0 missing values
libSVM","AAD group #Dataset from the LIBSVM data repository. Preprocessing: scaled to [-1,1]
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - 0 classes - 0 missing values
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!! Cholesterol treated as the class attribute. As used by Kilpatrick, D. & Cameron-Jones, M. (1998). Numeric prediction using…
175 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Donor: David W. Aha (aha@ics.uci.edu) This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one…
48 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - 0 classes - 6 missing values
Data from StatLib (ftp stat.cmu.edu/datasets) This is the data set called `DETROIT' in the book `Subset selection in regression' by Alan J. Miller published in the Chapman & Hall series of monographs…
2 runs0 likes0 downloads0 reach0 impact
13 instances - 14 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). The multi-class target feature is converted to a two-class nominal target feature by re-labeling the majority class as positive ('P') and…
131 runs0 likes0 downloads0 reach0 impact
990 instances - 14 features - 2 classes - 0 missing values
Of all the universities in the world, which are the best? Ranking universities is a difficult, political, and controversial practice. There are hundreds of different national and international…
0 runs0 likes0 downloads0 reach0 impact
1029 instances - 14 features - classes - 200 missing values
Content As of 2020, 6.7 million people reside in Victoria, Australia's second most populated state. Most of them, 5 million, live or work in Melbourne, state's capital. During 2020 Australia was among…
0 runs0 likes0 downloads0 reach0 impact
2106 instances - 14 features - classes - 4 missing values
Context The data set contains laboratory values of blood donors and Hepatitis C patients and demographic values like age. The data was obtained from UCI Machine Learning Repository:…
0 runs0 likes0 downloads0 reach0 impact
615 instances - 14 features - classes - 31 missing values
Context UFC 257: Poirier vs. McGregor 2 was a mixed martial arts event produced by the Ultimate Fighting Championship that took place on January 24, 2021 at the Etihad Arena on Yas Island, Abu Dhabi,…
0 runs0 likes0 downloads0 reach0 impact
40819 instances - 14 features - classes - 19084 missing values
Context TFT TFT is an 8-player free-for-all drafting tactics game in which the player recruits powerful champions, deploys them, and battles to become the last player standing. When acquired, a…
0 runs0 likes0 downloads0 reach0 impact
748928 instances - 14 features - classes - 187360 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach0 impact
26677 instances - 14 features - 3 classes - 0 missing values
Conventional and Social Media Movies (CSM) - Dataset 2014 and 2015 Data Set 12 features categorized as conventional and social media features. Both conventional features, collected from movies…
0 runs0 likes0 downloads0 reach0 impact
232 instances - 14 features - classes - 60 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
40768 instances - 14 features - classes - 0 missing values
Context This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one that has been used by ML…
0 runs0 likes0 downloads0 reach0 impact
303 instances - 14 features - classes - 0 missing values
The pandemic context brings new challenges to cities. This fantastic resource was created by Google with aggregated, anonymized sets of data from users who have turned on the Location History setting…
0 runs0 likes0 downloads0 reach0 impact
1048575 instances - 14 features - classes - 5573689 missing values
Context The Social Dilemma, a documentary-drama hybrid explores the dangerous human impact of social networking, with tech experts sounding the alarm on their own creations as the tech experts sound…
0 runs0 likes0 downloads0 reach0 impact
20068 instances - 14 features - classes - 10429 missing values
Context Winter Storm Uri in February 2021 caused havoc across the United States and specifically to Texas involving mass power outages, water and food shortages, and dangerous weather conditions. This…
0 runs0 likes0 downloads0 reach0 impact
23358 instances - 14 features - classes - 53699 missing values
Context Thinking of Natural Language Processing as a beginner!! The dataset has been about the wine comments or reviews that has been given by various wine tasters. The concept was to use text…
0 runs0 likes0 downloads0 reach0 impact
129971 instances - 14 features - classes - 204754 missing values
Historical data on avocado prices and sales volume in multiple US markets. For this version Date column is dropped and month and day information in kept.
0 runs0 likes0 downloads0 reach0 impact
18249 instances - 14 features - 0 classes - 0 missing values
This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. The goal field refers to the presence of heart disease in the patient. It is integer valued…
0 runs0 likes0 downloads0 reach0 impact
296 instances - 14 features - 0 classes - 0 missing values
The analysis is performed for different sets of input values using the methodology similar to that described in [Schafer, Benjamin, et al. 'Taming instabilities in power grid networks by decentralized…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 14 features - classes - 0 missing values
Context Data set containing results from 2019 Ironman World Championship in Kona, Hawaii to help answer the question of which countries produce the fastest overall finish times, as well as fastest…
0 runs0 likes0 downloads0 reach0 impact
2434 instances - 14 features - classes - 766 missing values
Citation Request: This dataset is public available for research. The details are described in [Cortez et al., 2009]. Please include this citation if you plan to use this database: P. Cortez, A.…
0 runs0 likes0 downloads0 reach0 impact
6497 instances - 14 features - classes - 0 missing values
These data are the results of a chemical analysis of wines grown in the same region in Italy but derived from three different cultivars. The analysis determined the quantities of 13 constituents found…
0 runs0 likes0 downloads0 reach0 impact
178 instances - 14 features - classes - 0 missing values
Source: UCI Machine Learning Repository Content A dataset of manually-curated BCF for 779 chemicals was used to determine the mechanisms of bioconcentration, i.e. to predict whether a chemical: (1) is…
0 runs0 likes0 downloads0 reach0 impact
779 instances - 14 features - classes - 0 missing values
The SATO data set used is real life data collected from a major wireless telecom operator in South Asia.
0 runs0 likes0 downloads0 reach0 impact
2000 instances - 14 features - 2 classes - 453 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
252 instances - 14 features - classes - 0 missing values
Context Imbalanced classes put accuracy out of business. This is a surprisingly common problem in machine learning (specifically in classification), occurring in datasets with a disproportionate ratio…
0 runs0 likes0 downloads0 reach0 impact
1723 instances - 14 features - 0 classes - 0 missing values
Context Cricket based datasets are not readily available for analysis on most of the portals. Hence, an effort to provide test match data for 2200+ tests after cleaning it for very special cases.…
0 runs0 likes0 downloads0 reach0 impact
8291 instances - 14 features - classes - 8291 missing values
Context this dataset is containing information about HDFC bank equity share information. Content The dataset containing a total of 7 columns Date 5151 non-null datetime64[ns] Open 5082 non-null…
0 runs0 likes0 downloads0 reach0 impact
489 instances - 14 features - classes - 0 missing values
Context I wanted a highly imbalanced dataset to share with others. It has the perfect one for us. Imbalanced data typically refers to a classification problem where the number of observations per…
0 runs0 likes0 downloads0 reach0 impact
9578 instances - 14 features - classes - 0 missing values
Dataset Description Story View the ReadMe file in my Github repo for this project. Check out all the info on my portfolio's webpage for this project. As I write this, I'm a Data Science student. To…
0 runs0 likes0 downloads0 reach0 impact
9534417 instances - 14 features - classes - 0 missing values
Data are collected from Kickstarter Platform You'll find most useful data for project analysis. Columns are self explanatory except: usd_pledged: conversion in US dollars of the pledged column…
0 runs0 likes0 downloads0 reach0 impact
331675 instances - 14 features - classes - 210 missing values
test
0 runs0 likes0 downloads0 reach0 impact
270 instances - 14 features - classes - 0 missing values