OpenML
Filter results by:
Context The World Health Organization (WHO) declared the 201920 coronavirus outbreak a pandemic and a Public Health Emergency of International Concern (PHEIC). Evidence of local transmission of the…
0 runs0 likes0 downloads0 reach0 impact
1010 instances - 7 features - classes - 8 missing values
Context I am currently learning Data Science concepts so I started my journey by doing some basic data visualisations. While I was looking for datasets online to visualise, I saw a Pokemon dataset. I…
0 runs0 likes0 downloads0 reach0 impact
1013 instances - 40 features - classes - 0 missing values
Context After the release of Pokmon Sword and Shield on the November 15, 2019, 81 new Pokemons were released alongside 13 regional variants of preexisting Pokmon. Content Like dataset from previous…
0 runs0 likes0 downloads0 reach0 impact
1017 instances - 13 features - classes - 479 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 12476, and it has 1023 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1023 instances - 1026 features - 0 classes - 0 missing values
Data Sets for 'Regression Models for Time Series Analysis' by B. Kedem and K. Fokianos, Wiley 2002. Submitted by Kostas Fokianos (fokianos@ucy.ac.cy) [8/Nov/02] (176k) Note: - attribute names were…
599 runs0 likes0 downloads0 reach0 impact
1024 instances - 3 features - 4 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 20154, and it has 1024 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1024 instances - 1026 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
746 runs0 likes0 downloads0 reach0 impact
1024 instances - 3 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11720, and it has 1027 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1027 instances - 1026 features - 0 classes - 0 missing values
Of all the universities in the world, which are the best? Ranking universities is a difficult, political, and controversial practice. There are hundreds of different national and international…
0 runs0 likes0 downloads0 reach0 impact
1029 instances - 14 features - classes - 200 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10475, and it has 1030 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1030 instances - 1026 features - 0 classes - 0 missing values
Concrete is the most important material in civil engineering. The concrete compressive strength is a highly nonlinear function of age and ingredients. These ingredients include cement, blast furnace…
3 runs0 likes0 downloads0 reach0 impact
1030 instances - 9 features - classes - 0 missing values
Context This data set aims to provide a start up for the ones who just started off with deep learning and act as a benchmark. Content The feature set includes: Cement Blast Furnace Slag Fly Ash Water…
0 runs0 likes0 downloads0 reach0 impact
1030 instances - 9 features - classes - 0 missing values
Concrete is the most important material in civil engineering. The concrete compressive strength is a highly nonlinear function of age and ingredients. Each instance represents a description, different…
0 runs0 likes0 downloads0 reach0 impact
1030 instances - 9 features - 0 classes - 0 missing values
This dataset contains estimates of the socioeconomic status (SES) position of each of 149 countries covering the period 1880-2010. Measures of SES, which are in decades, allow for a 130 year…
0 runs0 likes0 downloads0 reach0 impact
1036 instances - 10 features - classes - 0 missing values
ThedatahasbeenreleasedbySDSNandextractedbyPromptCloudscustomwebcrawlingsolutionContextTheWorldHappinessReportisalandmarksurveyofthestateofglobalhappinessthatranks156countriesbyhowhappytheircitizensperceivethemselvestobeThisyearsWorldHappinessReportfocusesonhappinessandthecommunityhowhappinesshasevolvedoverthepastdozenyearswithafocusonthetechnologiessocialnormsconflictsandgovernmentpoliciesthathavedriventhosechangesContentWhatisDystopiaDystopiaisanimaginarycountrythathastheworldsleasthappypeopleThepurposeinestablishingDystopiaistohaveabenchmarkagainstwhichallcountriescanbefavorablycomparednocountryperformsmorepoorlythanDystopiaintermsofeachofthesixkeyvariablesthusallowingeachsubbartobeofpositiveorzeroinsixinstanceswidthThelowestscoresobservedforthesixkeyvariablesthereforecharacterizeDystopiaSincelifewouldbeveryunpleasantinacountrywiththeworldslowestincomeslowestlifeexpectancylowestgenerositymostcorruptionleastfreedomandleastsocialsupportitisreferredtoasDystopiaincontrasttoUtopiaWhataretheresidualsTheresidualsorunexplainedcomponentsdifferforeachcountryreflectingtheextenttowhichthesixvariableseitheroverorunderexplainaverage20162018lifeevaluationsTheseresidualshaveanaveragevalueofapproximatelyzerooverthewholesetofcountriesFigure27showstheaverageresidualforeachcountryiftheequationinTable21isappliedtoaverage20162018dataforthesixvariablesinthatcountryWecombinetheseresidualswiththeestimateforlifeevaluationsinDystopiasothatthecombinedbarwillalwayshavepositivevaluesAscanbeseeninFigure27althoughsomelifeevaluationresidualsarequitelargeoccasionallyexceedingonepointonthescalefrom0to10theyarealwaysmuchsmallerthanthecalculatedvalueinDystopiawheretheaveragelifeisratedat188onthe0to10scaleTable7oftheonlineStatisticalAppendix1forChapter2putstheDystopiaplusresidualblockattheleftsideandalsodrawstheDystopialinemakingiteasytocomparethesignsandsizesoftheresidualsindifferentcountriesWhydoweusethesesixfactorstoexplainlifeevaluationsThevariablesusedreflectwhathasbeenbroadlyfoundintheresearchliteraturetobeimportantinexplainingnationalleveldifferencesinlifeevaluationsSomeimportantvariablessuchasunemploymentorinequalitydonotappearbecausecomparableinternationaldataarenotyetavailableforthefullsampleofcountriesThevariablesareintendedtoillustrateimportantlinesofcorrelationratherthantoreflectcleancausalestimatessincesomeofthedataaredrawnfromthesamesurveysourcessomearecorrelatedwitheachotherorwithotherimportantfactorsforwhichwedonothavemeasuresandinseveralinstancestherearelikelytobetwowayrelationsbetweenlifeevaluationsandthechosenvariablesforexamplehealthypeopleareoverallhappierbutasChapter4intheWorldHappinessReport2013demonstratedhappierpeopleareoverallhealthierInStatisticalAppendix1ofWorldHappinessReport2018weassessedthepossibleimportanceofusingexplanatorydatafromthesamepeoplewhoselifeevaluationsarebeingexplainedWedidthisbyrandomlydividingthesamplesintotwogroupsandusingtheaveragevaluesforegfreedomgleanedfromonegrouptoexplainthelifeevaluationsoftheothergroupThisloweredtheeffectsbutonlyveryslightlyeg2to3assuringusthatusingdatafromthesameindividualsisnotseriouslyaffectingtheresultsDatasourcehttpworldhappinessreported2019MoresuchdatasetscanbedownloadedfromDataStock…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - classes - 7 missing values
Context Some camera enthusiast went and described 1,000 cameras based on 13 properties! Content Row one describes the datatype for each column and can probably be removed. The 13 properties of each…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - 0 classes - 7 missing values
Source: 1. Olcay KURSUN, PhD., Istanbul University, Department of Computer Engineering, 34320, Istanbul, Turkey Phone: +90 (212) 473 7070 - 17827 Email: okursun '@' istanbul.edu.tr 2. Betul ERDOGDU…
0 runs0 likes0 downloads0 reach0 impact
1039 instances - 29 features - classes - 0 missing values
## **Meta-Album Boats Dataset (Mini)** The original version of the Meta-Album boats dataset is called MARVEL dataset (https://github.com/avaapm/marveldataset2016). It has more than 138 000 images of…
0 runs0 likes0 downloads0 reach1 impact
1040 instances - 3 features - 26 classes - 1040 missing values
Pizza cutter 3
188 runs0 likes7 downloads7 reach14 impact
1043 instances - 38 features - 2 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 101464, and it has 1044 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1044 instances - 1026 features - 0 classes - 0 missing values
Pokemons are something which fascinated me every single time. Who would believe that a 6 year old kid used to be late to school almost everyday because of watching those extra minutes of the Pokemon…
0 runs0 likes0 downloads0 reach0 impact
1045 instances - 13 features - 0 classes - 492 missing values
This file contains the Economic data information of USA from 01/04/1980 to 02/04/2000 on a weekly basis. From given features, the goal is to predict 1 Month CD Rate. 1. 1Y-CMaturityRate real [77.055,…
0 runs0 likes0 downloads0 reach0 impact
1049 instances - 16 features - 0 classes - 0 missing values
No data.
416 runs0 likes0 downloads0 reach0 impact
1050 instances - 3239 features - 10 classes - 0 missing values
Subsampling of the dataset qsar-biodeg (1494) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1055 instances - 42 features - 2 classes - 0 missing values
Subsampling of the dataset qsar-biodeg (1494) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1055 instances - 42 features - 2 classes - 0 missing values
Subsampling of the dataset qsar-biodeg (1494) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1055 instances - 42 features - 2 classes - 0 missing values
Subsampling of the dataset qsar-biodeg (1494) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1055 instances - 42 features - 2 classes - 0 missing values
Subsampling of the dataset qsar-biodeg (1494) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1055 instances - 42 features - 2 classes - 0 missing values
The QSAR biodegradation dataset was built in the Milano Chemometrics and QSAR Research Group. The research leading to these results has received funding from the European Communitys Seventh Framework…
0 runs0 likes0 downloads0 reach0 impact
1055 instances - 41 features - 2 classes - 0 missing values
QSAR biodegradation Data Set * Abstract: Data set containing values for 41 attributes (molecular descriptors) used to classify 1055 chemicals into 2 classes (ready and not ready biodegradable). *…
267861 runs1 likes25 downloads26 reach30 impact
1055 instances - 42 features - 2 classes - 0 missing values
Electrical-Maintenance data set This problem consists of four input variables and the available data set is comprised of a representative number of well distributed examples. In this case, the…
0 runs0 likes0 downloads0 reach0 impact
1056 instances - 5 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30017, and it has 1058 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1058 instances - 1026 features - 0 classes - 0 missing values
Predicting the Geographical Origin of Music, ICDM, 2014 Abstract: Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the…
0 runs0 likes0 downloads0 reach0 impact
1059 instances - 118 features - 0 classes - 0 missing values
Instances in this dataset contain audio features extracted from 1059 wave files. The task associated with the data is to predict the geographical origin of music. The dataset was built from a personal…
0 runs0 likes0 downloads0 reach0 impact
1059 instances - 117 features - 0 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Water Quality dataset (Dzeroski et al. 2000) has 14 target attributes that refer to the…
0 runs0 likes0 downloads0 reach0 impact
1060 instances - 30 features - classes - 0 missing values
This dataset contains information about used motorcycles This data can be used for a lot of purposes such as price prediction to exemplify the use of linear regression in Machine Learning. The columns…
0 runs0 likes0 downloads0 reach0 impact
1061 instances - 7 features - classes - 435 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach0 impact
1066 instances - 13 features - classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : The Solar Flare dataset (Lichman 2013) has 3 target variables that correspond to the number of…
0 runs0 likes0 downloads0 reach0 impact
1066 instances - 13 features - classes - 0 missing values
flare-pmlb
32 runs0 likes0 downloads0 reach0 impact
1066 instances - 11 features - 2 classes - 0 missing values
Relevant Information: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Each instance represents captured features…
31 runs0 likes0 downloads0 reach0 impact
1066 instances - 13 features - 6 classes - 0 missing values
Predict the number of common solar flares. The database contains 3 classes, one for the number of times a certain type of solar flare occurred in a 24 hour period. Each instance represents captured…
0 runs0 likes0 downloads0 reach0 impact
1066 instances - 11 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10696, and it has 1070 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1070 instances - 1026 features - 0 classes - 0 missing values
One of the primary challenges in identifying the risks of the Burst Header Packet (BHP) flood attacks in Optical Burst Switching networks (OBS) is the scarcity of reliable historical data. ###…
0 runs0 likes0 downloads0 reach0 impact
1075 instances - 22 features - classes - 15 missing values
bla
0 runs0 likes0 downloads0 reach0 impact
1075 instances - 10 features - classes - 0 missing values
pie chart 3
103 runs0 likes6 downloads6 reach13 impact
1077 instances - 38 features - 2 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: E2 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
105 runs0 likes0 downloads0 reach0 impact
1080 instances - 4 features - 5 classes - 0 missing values
Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative learning. The…
9545 runs0 likes0 downloads0 reach21 impact
1080 instances - 82 features - 8 classes - 1396 missing values
## **Meta-Album Plant Doc Dataset (Mini)** The PlantDoc dataset(https://github.com/pratikkayal/PlantDoc-Dataset) is made up of images of leaves of healthy and unhealthy plants. The images were…
0 runs0 likes0 downloads0 reach1 impact
1080 instances - 3 features - 27 classes - 1080 missing values
Subsampling of the dataset cnae-9 (1468) with seed=0 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 101 features - 9 classes - 0 missing values
Subsampling of the dataset cnae-9 (1468) with seed=1 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 101 features - 9 classes - 0 missing values
Subsampling of the dataset cnae-9 (1468) with seed=2 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 101 features - 9 classes - 0 missing values
Subsampling of the dataset cnae-9 (1468) with seed=3 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 101 features - 9 classes - 0 missing values
Subsampling of the dataset cnae-9 (1468) with seed=4 args.nrows=2000 args.ncols=100 args.nclasses=10 args.no_stratify=True Generated with the following source code: ```python def subsample( self,…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 101 features - 9 classes - 0 missing values
Context Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 81 features - classes - 1396 missing values
### Description This is a data set containing 1080 documents of free text business descriptions of Brazilian companies categorized into a subset of 9 categories. ### Source ``` Patrick Marques…
34579 runs0 likes17 downloads17 reach58 impact
1080 instances - 857 features - 9 classes - 0 missing values
aasdasda
0 runs0 likes0 downloads0 reach0 impact
1085 instances - 1 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10315, and it has 1086 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1086 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11755, and it has 1089 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1089 instances - 1026 features - 0 classes - 0 missing values
source: http://plato.asu.edu/ftp/solvable.html authors: Rolf-David Bergdoll PAR10 performances of modern solvers on the solvable instances of MIPLIB2010. http://miplib.zib.de/ The algorithm runtime…
0 runs0 likes0 downloads0 reach0 impact
1090 instances - 148 features - 0 classes - 0 missing values
source: http://plato.asu.edu/ftp/solvable.html authors: Rolf-David Bergdoll PAR10 performances of modern solvers on the solvable instances of MIPLIB2010. http://miplib.zib.de/ The algorithm runtime…
0 runs0 likes0 downloads0 reach0 impact
1090 instances - 145 features - 0 classes - 0 missing values
No data.
0 runs0 likes0 downloads0 reach0 impact
1090 instances - 147 features - 0 classes - 0 missing values
source: http://plato.asu.edu/ftp/solvable.html authors: Rolf-David Bergdoll PAR10 performances of modern solvers on the solvable instances of MIPLIB2010. http://miplib.zib.de/ The algorithm runtime…
0 runs0 likes0 downloads0 reach0 impact
1090 instances - 145 features - 0 classes - 0 missing values
Why 2017 to 2020? The BitCoin data is Limited to these 3 years because as everyone Knows: Dec 2017 was the time when BitCoin Prices Skyrocketed! Hence, this duration is Perfect for Predicting future…
0 runs0 likes0 downloads0 reach0 impact
1097 instances - 7 features - classes - 0 missing values
Context A dataset I used to classify tweets about my company. I took tweets and I classified them manually as positive, negative or neutral. Content There are 4 columns : Id : the tweed id, unique.…
0 runs0 likes0 downloads0 reach0 impact
1097 instances - 4 features - classes - 0 missing values
* Dataset Title: AutoUniv Dataset data problem: autoUniv-au7-300-drift-au7-cpd1-800 * Abstract: AutoUniv is an advanced data generator for classifications tasks. The aim is to reflect the nuances and…
7130 runs0 likes0 downloads0 reach0 impact
1100 instances - 13 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10548, and it has 1100 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1100 instances - 1026 features - 0 classes - 0 missing values
This dataset has been scrapped off Goodreads to obtain land information about the best books of the 19th Century. Feature Description Book_Name the title of the book Author_Name the author(s) of the…
0 runs0 likes0 downloads0 reach0 impact
1101 instances - 5 features - classes - 4 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30043, and it has 1103 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1103 instances - 1026 features - 0 classes - 0 missing values
Context Emotions Detection Is an Interesting Blend of Psychology and Technology. -This Technology Helps to Build a Companion Robots,This Robots Can be Friendly and Have The Ability to Recognize Users…
0 runs0 likes0 downloads0 reach0 impact
1104 instances - 8 features - classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11242, and it has 1107 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1107 instances - 1026 features - 0 classes - 0 missing values
One of the NASA Metrics Data Program defect data sets. Data from flight software for earth orbiting satellite. Data comes from McCabe and Halstead features extractors of source code. These features…
149998 runs0 likes27 downloads27 reach28 impact
1109 instances - 22 features - 2 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: E5 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
105 runs0 likes0 downloads0 reach0 impact
1112 instances - 4 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 30036, and it has 1116 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1116 instances - 1026 features - 0 classes - 0 missing values
Context I have collected the Toronto Apartment Rental prices from various sources in local websites. Content There are 7 columns in the dataset. Bedroom - How many bedrooms available Bathroom - How…
0 runs0 likes0 downloads0 reach0 impact
1124 instances - 7 features - 188 classes - 0 missing values
parity5_plus_5-pmlb
31 runs0 likes0 downloads0 reach22 impact
1124 instances - 11 features - 2 classes - 0 missing values
Context Dogecoin is making news as well as a little profit these days although it may seem like its new in the market but it has been around quite a while now. I have tried to collect historical data…
0 runs0 likes0 downloads0 reach0 impact
1131 instances - 7 features - classes - 24 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 246, and it has 1135 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
1135 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17080, and it has 1135 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1135 instances - 1026 features - 0 classes - 0 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles See Click Predict Fix competition…
0 runs0 likes0 downloads0 reach0 impact
1137 instances - 26 features - classes - 9255 missing values
Multivariate regression data set from: https://link.springer.com/article/10.1007%2Fs10994-016-5546-z : This is a pre-processed version of the dataset used in Kaggles See Click Predict Fix competition…
0 runs0 likes0 downloads0 reach0 impact
1137 instances - 26 features - classes - 9255 missing values
wine-quality data for testing in automl-benchmark
0 runs0 likes0 downloads0 reach0 impact
1143 instances - 13 features - classes - 0 missing values
DOROTHEA is a drug discovery dataset. Chemical compounds represented by structural molecular features must be classified as active (binding to thrombin) or inactive. This is one of 5 datasets of the…
0 runs0 likes0 downloads0 reach0 impact
1150 instances - 100001 features - 2 classes - 0 missing values
Israeli lottery
0 runs0 likes0 downloads0 reach0 impact
1153 instances - 11 features - classes - 0 missing values
17x17x2x2 tables of counts in GLIM-ready format used for the analyses in Biblarz, Timothy J., and Adrian E. Raftery. 1993. "The Effects of Family Disruption on Social Mobility." American Sociological…
6 runs0 likes0 downloads0 reach0 impact
1156 instances - 6 features - 0 classes - 0 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
1899 runs0 likes0 downloads0 reach0 impact
1156 instances - 6 features - 2 classes - 0 missing values
This dataset described social mobility, i.e. how the sons' occupations are related to their fathers' jobs. An instance represent the number of sons that have a certain job A given the father has the…
0 runs0 likes0 downloads0 reach0 impact
1156 instances - 6 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 100080, and it has 1157 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1157 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 17085, and it has 1159 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1159 instances - 1026 features - 0 classes - 0 missing values
## **Meta-Album MPII Human Pose Dataset Dataset (Mini)** The MPII Human Pose dataset (http://human-pose.mpi-inf.mpg.de/#download) is a state of the art benchmark for evaluation of articulated human…
0 runs0 likes0 downloads0 reach1 impact
1160 instances - 3 features - 29 classes - 1160 missing values
The AAUP dataset for the ASA Statistical Graphics Section's 1995 Data Analysis Exposition contains information on faculty salaries for 1161 American colleges and universities. The data may be obtained…
32 runs0 likes0 downloads0 reach0 impact
1161 instances - 15 features - 4 classes - 256 missing values
Binarized version of the original data set (see version 1). It converts the numeric target feature to a two-class nominal target feature by computing the mean and classifying all instances with a…
101 runs0 likes0 downloads0 reach0 impact
1161 instances - 16 features - 2 classes - 256 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10209, and it has 1171 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1171 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 247, and it has 1171 rows and 1026 features (including…
1 runs0 likes0 downloads0 reach0 impact
1171 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11290, and it has 1171 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1171 instances - 1026 features - 0 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11727, and it has 1174 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1174 instances - 1026 features - 0 classes - 0 missing values
Context The aim of this dataset is to offer in a relatively small number of columns (30) data to compare the performance of some football players, or to compare the efficiency of strikers in-between…
0 runs0 likes0 downloads0 reach0 impact
1177 instances - 32 features - classes - 3867 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 11638, and it has 1180 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1180 instances - 1026 features - 0 classes - 0 missing values
* Dataset Title: Volcanoes on Venus - JARtool experiment Data Set Experiment: E1 * Source: Michael C. Burl MS 126-347, JPL 4800 Oak Grove Drive Pasadena, CA 91109 (818) 393-5345 Michael.C.Burl '@'…
105 runs0 likes0 downloads0 reach0 impact
1183 instances - 4 features - 5 classes - 0 missing values
This dataset contains QSAR data (from ChEMBL version 17) showing activity values (unit is pseudo-pCI50) of several compounds on drug target TID: 10003, and it has 1187 rows and 1026 features…
1 runs0 likes0 downloads0 reach0 impact
1187 instances - 1026 features - 0 classes - 0 missing values
Water stress dataset for Indian variety of wheat crop: The data consist of a file system-based data of Raj 3765 variety of wheat. There are twenty-four chlorophyll fluorescence images captured every…
0 runs0 likes0 downloads0 reach0 impact
1188 instances - 23 features - 0 classes - 0 missing values