Data
Filter results by:
Context It's the preprocessed train data from Quora Insincere Questions competition 2018 The original train data is preprocessed to remove stop words, numbers, punctuations, common words and converted…
0 runs0 likes0 downloads0 reach0 impact
1306122 instances - 4 features - classes - 1 missing values
Context With the growing dependency that our society has on the Internet, the amount of data that goes through networks keeps increasing. Network monitoring and analysis of consumption behavior…
0 runs0 likes0 downloads0 reach0 impact
1249 instances - 113 features - classes - 0 missing values
Palmer Penguins Dataset The goal of palmerpenguins is to provide a great dataset for data exploration visualization, as an alternative to iris. About the data Data were collected and made available by…
0 runs0 likes0 downloads0 reach0 impact
344 instances - 7 features - classes - 19 missing values
Context Started this for my final year project improved on it during quarantine. Content A mix of used and new car reviews from the year 2000 to 2019 of various brands from edmunds.com. I did not…
0 runs0 likes0 downloads0 reach0 impact
299045 instances - 8 features - classes - 171 missing values
CryptocurrenciesCryptocurrencies are fast becoming rivals to traditional currency across the world. The digital currencies are available to purchase in many different places, making it accessible to…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
Context This dataset is meant to complement the Austin Bikesharing Dataset. Content Contains the: Date (YYYY-MM-DD) TempHighF (High temperature, in Fahrenheit) TempAvgF (Average temperature, in…
0 runs0 likes0 downloads0 reach0 impact
1319 instances - 21 features - classes - 0 missing values
Cryptocurrencies Cryptocurrencies are fast becoming rivals to traditional currency across the world. The digital currencies are available to purchase in many different places, making it accessible to…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
Context The consumer credit department of a bank wants to automate the decisionmaking process for approval of home equity lines of credit. To do this, they will follow the recommendations of the Equal…
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 13 features - classes - 5271 missing values
The dataset was created by Angeliki Xifara (angxifara '@' gmail.com, Civil/Structural Engineer) and was processed by Athanasios Tsanas (tsanasthanasis '@' gmail.com, Oxford Centre for Industrial and…
0 runs0 likes0 downloads0 reach0 impact
768 instances - 10 features - classes - 0 missing values
Context Chocolate is one of the most popular candies in the world. Each year, residents of the United States collectively eat more than 2.8 billions pounds. However, not all chocolate bars are created…
0 runs0 likes0 downloads0 reach0 impact
1795 instances - 9 features - classes - 962 missing values
Context Chocolate is one of the most popular candies in the world. Each year, residents of the United States collectively eat more than 2.8 billions pounds. However, not all chocolate bars are created…
0 runs0 likes0 downloads0 reach0 impact
1795 instances - 9 features - classes - 962 missing values
Context This dataset was collected by Neha Prerna Tigga and Dr. Shruti Garg of the Department of Computer Science and Engineering, BIT Mesra, Ranchi-835215 for research, non-commercial purposes only.…
0 runs0 likes0 downloads0 reach0 impact
952 instances - 18 features - classes - 48 missing values
Context Projects are a great way to learn data science. So I started my own. The numerous housing data sets on Kaggle were the inspiration for this data set. Predicting housing prices is a simple yet…
0 runs0 likes0 downloads0 reach0 impact
10552 instances - 26 features - classes - 49282 missing values
Mammography is the most effective method for breast cancer screening available today. However, the low positive predictive value of breast biopsy resulting from mammogram interpretation leads to…
0 runs0 likes0 downloads0 reach0 impact
830 instances - 6 features - classes - 0 missing values
Context A corporate credit rating expresses the ability of a firm to repay its debt to creditors. Credit rating agencies are the entities responsible to make the assessment and give a verdict. When a…
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - classes - 0 missing values
Context While brainstorming ideas for a statistics project for a course last semester, the idea of utilizing data about microbreweries came up. Unfortunately after some exploration and thought, we…
0 runs0 likes0 downloads0 reach0 impact
2407 instances - 6 features - classes - 8 missing values
ContextThisdatasetismeanttocomplementtheAustinBikesharingDatasetContentContainstheDateYYYYMMDDTempHighFHightemperatureinFahrenheitTempAvgFAveragetemperatureinFahrenheitTempLowFLowtemperatureinFahrenheitDewPointHighFHighdewpointinFahrenheitDewPointAvgFAveragedewpointinFahrenheitDewPointLowFLowdewpointinFahrenheitHumidityHighPercentHighhumidityasapercentageHumidityAvgPercentAveragehumidityasapercentageHumidityLowPercentLowhumidityasapercentageSeaLevelPressureHighInchesHighsealevelpressureininchesSeaLevelPressureAvgInchesAveragesealevelpressureininchesSeaLevelPressureLowInchesLowsealevelpressureininchesVisibilityHighMilesHighvisibilityinmilesVisibilityAvgMilesAveragevisibilityinmilesVisibilityLowMilesLowvisibilityinmilesWindHighMPHHighwindspeedinmilesperhourWindAvgMPHAveragewindspeedinmilesperhourWindGustMPHHighestwindspeedgustinmilesperhourPrecipitationSumInchesTotalprecipitationininchesTifTraceEventsAdverseweathereventsifNoneThisdatasetcontainsdataforeverydatefrom20131221to20170731AcknowledgementsThisdatasetwasobtainedfromWeatherUndergroundcomattheAustinKATTstationInspirationCanweusethisdatasettoexplainsomeofthevariationintheAustinBikesharingDataset…
0 runs0 likes0 downloads0 reach0 impact
1319 instances - 21 features - classes - 0 missing values
CryptocurrenciesCryptocurrenciesarefastbecomingrivalstotraditionalcurrencyacrosstheworldThedigitalcurrenciesareavailabletopurchaseinmanydifferentplacesmakingitaccessibletoeveryoneandwithretailersacceptingvariouscryptocurrenciesitcouldbeasignthatmoneyasweknowitisabouttogothroughamajorchangeInadditiontheblockchaintechnologyonwhichmanycryptocurrenciesarebasedwithitsrevolutionarydistributeddigitalbackbonehasmanyotherpromisingapplicationsImplementationsofsecuredecentralizedsystemscanaidusinconqueringorganizationalissuesoftrustandsecuritythathaveplaguedoursocietythroughouttheagesIneffectwecanfundamentallydisruptindustriescoretoeconomiesbusinessesandsocialstructureseliminatinginefficiencyandhumanerrorContentThedatasetcontainsallhistoricaldailypricesopenhighlowcloseforallcryptocurrencieslistedonCoinMarketCapAcknowledgementsEveryCryptocurrencyDailyMarketPriceIinitiallydevelopedkernelsforthisdatasetbeforemakingmyownscraperanddatasetsothatIcouldkeepitregularlyupdatedCoinMarketCapForthedata…
0 runs0 likes0 downloads0 reach0 impact
632218 instances - 9 features - classes - 69712 missing values
ContextTheconsumercreditdepartmentofabankwantstoautomatethedecisionmakingprocessforapprovalofhomeequitylinesofcreditTodothistheywillfollowtherecommendationsoftheEqualCreditOpportunityActtocreateanempiricallyderivedandstatisticallysoundcreditscoringmodelThemodelwillbebasedondatacollectedfromrecentapplicantsgrantedcreditthroughthecurrentprocessofloanunderwritingThemodelwillbebuiltfrompredictivemodelingtoolsbutthecreatedmodelmustbesufficientlyinterpretabletoprovideareasonforanyadverseactionsrejectionsContentTheHomeEquitydatasetHMEQcontainsbaselineandloanperformanceinformationfor5960recenthomeequityloansThetargetBADisabinaryvariableindicatingwhetheranapplicanteventuallydefaultedorwasseriouslydelinquentThisadverseoutcomeoccurredin1189cases20Foreachapplicant12inputvariableswererecordedAcknowledgementsInspirationWhatifyoucanpredictclientswhodefaultontheirloans…
0 runs0 likes0 downloads0 reach0 impact
5960 instances - 13 features - classes - 5271 missing values
Context Traffic data collected from the several Wavetronix radar sensors deployed by the City of Austin. Dataset is augmented with geo coordinates from sensor location dataset. Source:…
0 runs0 likes0 downloads0 reach0 impact
4603861 instances - 12 features - classes - 0 missing values
ContextThisdatasetwascollectedbyNehaPrernaTiggaandDrShrutiGargoftheDepartmentofComputerScienceandEngineeringBITMesraRanchi835215forresearchnoncommercialpurposesonlyAnarticleisalsopublishedimplementingthisdatasetFormoreinformationandcitationofthisdatasetpleasereferTiggaNPGargS2020PredictionofType2DiabetesusingMachineLearningClassificationMethodsProcediaComputerScience167706716DOIhttpsdoiorg101016jprocs202003336ContentThereisatotalof952instanceswith17independentpredictorvariablesandonebinarytargetordependentvariableDiabetesAcknowledgementsWewouldliketothankalltheparticipantswhocontributedtowardsthebuildingofthisdatasetInspirationTobuildamachinelearningalgorithmtopredictifapersonhasdiabetesornot…
0 runs0 likes0 downloads0 reach0 impact
952 instances - 18 features - classes - 48 missing values
ContextProjectsareagreatwaytolearndatascienceSoIstartedmyownThenumeroushousingdatasetsonKaggleweretheinspirationforthisdatasetPredictinghousingpricesisasimpleyetinsightfulregressionproblemUnderstandingdatatakestimeandthemorepeopleanalyzeitthefasterthesecretscanbeuncoveredIacquiredthedatabyscrapingImmoScout24amarketplaceforGermanrealestate…
0 runs0 likes0 downloads0 reach0 impact
10552 instances - 26 features - classes - 49282 missing values
ContextAcorporatecreditratingexpressestheabilityofafirmtorepayitsdebttocreditorsCreditratingagenciesaretheentitiesresponsibletomaketheassessmentandgiveaverdictWhenabigcorporationfromtheUSoranywhereintheworldwantstoissueanewbondithiresacreditagencytomakeanassessmentsothatinvestorscanknowhowtrustworthyisthecompanyTheassessmentisbasedespeciallyinthefinancialsindicatorsthatcomefromthebalancesheetSomeofthemostimportantagenciesintheworldareMoodysFitchandStandardandPoorsContentAlistof2029creditratingsissuedbymajoragenciessuchasStandardandPoorstobigUSfirmstradedonNYSEorNasdaqfrom2010to2016Thereare30featuresforeverycompanyofwhich25arefinancialindicatorsTheycanbedividedinLiquidityMeasurementRatioscurrentRatioquickRatiocashRatiodaysOfSalesOutstandingProfitabilityIndicatorRatiosgrossProfitMarginoperatingProfitMarginpretaxProfitMarginnetProfitMargineffectiveTaxRatereturnOnAssetsreturnOnEquityreturnOnCapitalEmployedDebtRatiosdebtRatiodebtEquityRatioOperatingPerformanceRatiosassetTurnoverCashFlowIndicatorRatiosoperatingCashFlowPerSharefreeCashFlowPerSharecashPerShareoperatingCashFlowSalesRatiofreeCashFlowOperatingCashFlowRatioFormoreinformationaboutfinancialindicatorsvisithttpsfinancialmodelingprepcommarketindexesmajormarketsTheadditionalfeaturesareNameSymbolfortradingRatingAgencyNameDateandSectorThedatasetisunbalancedhereisthefrequencyofratingsAAA7AA89A398BBB671BB490B302CCC64CC5C2D1AcknowledgementsThisdatasetwaspossiblethankstofinancialmodelingprepandopendatasoftthesourcesofthedataToseehowthedatawasintegratedandreshapedcheckhereInspirationIsitpossibletoforecasttheratinganagencywillgivetoacompanybasedonitsfinancials…
0 runs0 likes0 downloads0 reach0 impact
2029 instances - 31 features - classes - 0 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
MINI - Birds dataset
0 runs0 likes0 downloads0 reach0 impact
500 instances - 3 features - 20 classes - 0 missing values
mini insect example dataset # 1
0 runs0 likes0 downloads0 reach0 impact
12 instances - 4 features - 4 classes - 0 missing values
ContextThis dataset is meant to complement the Austin Bikesharing Dataset.ContentContains the Date YYYYMMDD TempHighF High temperature, in Fahrenheit TempAvgF Average temperature, in Fahrenheit…
0 runs0 likes0 downloads0 reach0 impact
1319 instances - 21 features - classes - 0 missing values
This dataset contains the daily currency exchange rates as reported to the International Monetary Fund by the issuing central bank. Included are 51 currencies over the period from 01-01-1995 to…
0 runs0 likes0 downloads0 reach0 impact
5978 instances - 52 features - classes - 61189 missing values
Context Japanese animation, which is known as anime, has become internationally widespread nowadays. This dataset provides data on anime taken from Anime News Network. Content This dataset consists of…
0 runs0 likes0 downloads0 reach0 impact
1563 instances - 20 features - classes - 0 missing values
Context This dataset has an exhaustive list of player statistics for each season from 2013 - 2020 Content Each row is associated with a player and a season. Eg. You will have 7 rows for Lionel Messi:…
0 runs0 likes0 downloads0 reach0 impact
5733 instances - 109 features - classes - 187897 missing values
San Francisco International Airport Report on Monthly Passenger Traffic Statistics by Airline. Airport data is seasonal in nature, therefore any comparative analyses should be done on a…
0 runs0 likes0 downloads0 reach0 impact
15007 instances - 16 features - classes - 108 missing values
Context This data set could be used without permission for purposes of enhancing machine learning and data science. It can easily be adapted for classification, regression and clustering. At a…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 16 features - classes - 1080 missing values
All data is scraped from ESPN's Team Stats page for each game. Seasons include all 256 regular season games plus 11 playoff games, with the exception of 3 games that are missing from ESPN's site:…
0 runs0 likes0 downloads0 reach0 impact
5357 instances - 39 features - classes - 0 missing values
The New York City Bike Share enables quick, easy, and affordable bike trips around the New York city boroughs. They make regular open data releases (this dataset is a transformed version of the data…
0 runs0 likes0 downloads0 reach0 impact
735502 instances - 17 features - classes - 0 missing values
Introduction This dataset is part of my PhD research on malware detection and classification using Deep Learning. It contains static analysis data (PE Section Headers of the .text, .code and CODE…
0 runs0 likes0 downloads0 reach0 impact
43293 instances - 5 features - classes - 0 missing values
This a blend dataset that contains historic Swedish interest rates from 1908-2001 Source/Klla: Sveriges riksbank and Swedish inflation rate 1908-2001 fetched from Sweden's statistic central bureau…
0 runs0 likes0 downloads0 reach0 impact
109 instances - 4 features - classes - 49 missing values
Description Amazon.com, Inc., is an American multinational technology company based in Seattle that focuses on e-commerce, cloud computing, digital streaming, and artificial intelligence. It is…
0 runs0 likes0 downloads0 reach0 impact
5842 instances - 7 features - classes - 0 missing values
Traffic violations followed the invention of the automobile: the first traffic ticket in the United States was allegedly given to a New York City cab driver on May 20, 1899, for going at the breakneck…
0 runs0 likes0 downloads0 reach0 impact
1018634 instances - 34 features - classes - 328559 missing values
Context The Philippine Statistics Authority (PSA) spearheads the conduct of the Family Income and Expenditure Survey (FIES) nationwide. The survey, which is undertaken every three (3) years, is aimed…
0 runs0 likes0 downloads0 reach0 impact
41544 instances - 60 features - classes - 15072 missing values
Context Expression levels of 77 proteins measured in the cerebral cortex of 8 classes of control and Down syndrome mice exposed to context fear conditioning, a task used to assess associative…
0 runs0 likes0 downloads0 reach0 impact
1080 instances - 81 features - classes - 1396 missing values
Context Melbourne real estate is BOOMING. Can you find the insight or predict the next big trend to become a real estate mogul or even harder, to snap up a reasonably priced 2-bedroom unit? Content…
0 runs0 likes0 downloads0 reach0 impact
13580 instances - 21 features - classes - 13256 missing values
Context The objective of this dataset is to create a chess engine through machine learning. In this first part we will first predict the pieces to be moved depending on the position of the chessboard…
0 runs0 likes0 downloads0 reach0 impact
2632753 instances - 66 features - classes - 0 missing values
Context I am currently writing a seminar paper about content summarization in Twitter networks. One of the Frameworks I read about uses a peak detection algorithm to cluster tweets by topic-aware peak…
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 52 features - classes - 0 missing values
This dataset is based on dipam7/student-grade-prediction Dataset General information This data approach student achievement in secondary education of two Portuguese schools. The data attributes…
0 runs0 likes0 downloads0 reach0 impact
395 instances - 30 features - 0 classes - 0 missing values
Why 2017 to 2020? The BitCoin data is Limited to these 3 years because as everyone Knows: Dec 2017 was the time when BitCoin Prices Skyrocketed! Hence, this duration is Perfect for Predicting future…
0 runs0 likes0 downloads0 reach0 impact
1097 instances - 7 features - classes - 0 missing values
Dataset Information This dataset contains information on default payments, demographic factors, credit data, history of payment, and bill statements of credit card clients in Taiwan from April 2005 to…
0 runs0 likes0 downloads0 reach0 impact
30000 instances - 24 features - classes - 0 missing values
This data was extracted from the 1994 Census bureau database by Ronny Kohavi and Barry Becker (Data Mining and Visualization, Silicon Graphics). A set of reasonably clean records was extracted using…
0 runs0 likes0 downloads0 reach0 impact
32561 instances - 15 features - classes - 4262 missing values
Voice Gender Gender Recognition by Voice and Speech Analysis This database was created to identify a voice as male or female, based upon acoustic properties of the voice and speech. The dataset…
0 runs0 likes0 downloads0 reach0 impact
3168 instances - 21 features - classes - 0 missing values
Detailed data description of Credit Risk dataset: Feature Name Description person_age Age person_income Annual Income personhomeownership Home ownership personemplength Employment length (in years)…
0 runs0 likes0 downloads0 reach0 impact
32581 instances - 12 features - classes - 4011 missing values
About This dataset is based on https://www.kaggle.com/manjeetsingh/retaildataset Historical sales data covers the period from 2010-02-05 to 2012-11-01. Sales and Features data was merged, Weekly sales…
0 runs0 likes0 downloads0 reach0 impact
149 instances - 12 features - classes - 0 missing values
Data Set Name: Autistic Spectrum Disorder Screening Data for Adult Autistic Spectrum Disorder (ASD) is a neurodevelopment condition associated with significant healthcare costs, and early diagnosis…
0 runs0 likes0 downloads0 reach0 impact
704 instances - 21 features - classes - 192 missing values
Students' Academic Performance Dataset (xAPI-Edu-Data) Data Set Characteristics: Multivariate Number of Instances: 480 Area: E-learning, Education, Predictive models, Educational Data Mining Attribute…
0 runs0 likes0 downloads0 reach0 impact
480 instances - 17 features - classes - 0 missing values
Content This dataset comprises of various house listings in London and neighbouring region. It also encompasses the parameters listed below, the definitions of which are quite self-explanatory.…
0 runs0 likes0 downloads0 reach0 impact
3480 instances - 11 features - classes - 962 missing values
Context Real Estate data from the Canadian capital city of Ottawa for you to test your data analytics skill. This is my first dataset in Kaggle. Please let me know of anything that needs to be…
0 runs0 likes0 downloads0 reach0 impact
1255 instances - 13 features - classes - 2598 missing values
Context Hi, The original Dataset wad published by Frankcc in the following link: Link Kaggle The dataset is involved into the analysis of depression. The data was consists as a study about the life…
0 runs0 likes0 downloads0 reach0 impact
1429 instances - 23 features - classes - 20 missing values
Context A person makes a doctor appointment, receives all the instructions and no-show. Who to blame? If this help you studying or working, please dont forget to upvote :). Reference to Joni Hoppen…
0 runs0 likes0 downloads0 reach0 impact
110527 instances - 13 features - 2 classes - 0 missing values
Data Description Dataset is from one of the leading travel site containing hotel reviews provided by customers. Variable Description User_ID unique ID of the customer Description description of the…
0 runs0 likes0 downloads0 reach0 impact
38932 instances - 4 features - classes - 0 missing values
Context Dataset of the original 151 Pokemon with stats only from Generation 1. Data was scraped from https://serebii.net/. Content Number: Pokedex Index Number Name: Name of the Pokemon Types: The…
0 runs0 likes0 downloads0 reach0 impact
151 instances - 34 features - classes - 0 missing values
Context I have collected the Toronto Apartment Rental prices from various sources in local websites. Content There are 7 columns in the dataset. Bedroom - How many bedrooms available Bathroom - How…
0 runs0 likes0 downloads0 reach0 impact
1124 instances - 7 features - 188 classes - 0 missing values
Context Train your project by getting weather data from Istanbul. Content You can find daily weather, rainfall, sunrise time, sunset time, moonrise time, sunset time, average wind speed, average…
0 runs0 likes0 downloads0 reach0 impact
3896 instances - 12 features - classes - 263 missing values
Context METAR is a format for reporting weather information. A METAR weather report is predominantly used by pilots in fulfillment of a part of a pre-flight weather briefing, and by meteorologists,…
0 runs0 likes0 downloads0 reach0 impact
8787 instances - 13 features - classes - 6960 missing values
Context Tumor microRNA expression profiling identifies circulating microRNAs for earlier breast cancer detection. Due to microRNA role in tumorigenesis and remarkable stability in body fluids,…
0 runs0 likes0 downloads0 reach0 impact
133 instances - 1928 features - classes - 0 missing values
Context Tumor microRNA expression profiling identifies circulating microRNAs for earlier breast cancer detection. Due to microRNA role in tumorigenesis and remarkable stability in body fluids,…
0 runs0 likes0 downloads0 reach0 impact
133 instances - 1928 features - classes - 0 missing values
Naturally, I am myself an NBA fan and I don't prefer when I have data that has missing information. But not missing as like missing instance in columns, I mean the whole columns that I could use in my…
0 runs0 likes0 downloads0 reach0 impact
54514 instances - 142 features - classes - 0 missing values
Context I wanted a highly imbalanced dataset to share with others. It has the perfect one for us. Imbalanced data typically refers to a classification problem where the number of observations per…
0 runs0 likes0 downloads0 reach0 impact
9578 instances - 14 features - classes - 0 missing values
Context This works focuses upon creating a data set on Pandas Q/A over StackOverflow. Presently, there are more than 90k+ questions available on StackOverflow which have been asked under Pandas…
0 runs0 likes0 downloads0 reach0 impact
87241 instances - 16 features - classes - 472864 missing values
Context I scraped all of the currently available Urban Dictionary pages (611) on 3/26/17 Content word - the slang term added to urban dictionary definition - the definition of said term author - the…
0 runs0 likes0 downloads0 reach0 impact
4272 instances - 6 features - classes - 0 missing values
Context Includes data on confirmed cases, deaths, hospitalizations, and testing, as well as other variables of potential interest. Content As of 26 January 2021, the columns are: isocode, continent,…
0 runs0 likes0 downloads0 reach0 impact
63381 instances - 59 features - classes - 1508423 missing values
Context This is home value data for the hot Nashville market. Content There are 56,000+ rows altogether. However, I'm missing home detail data for about half. So if anyone wants to track that down…
0 runs0 likes0 downloads0 reach0 impact
56636 instances - 31 features - classes - 648773 missing values
Context Hourly weather data for New York City. Extracted from online web sources. The following data set is cleaned for the purpose for NYC Taxi ETA calculation. Content We have features such as Date,…
0 runs0 likes0 downloads0 reach0 impact
5141 instances - 7 features - classes - 4 missing values
Context Using TA-Lib : Technical Analysis Library. Backtest on the SPY Index data, using support and resistance indicators or any other indicator. Content Data contains daily SPY Index: Date Open High…
0 runs0 likes0 downloads0 reach0 impact
6126 instances - 7 features - classes - 0 missing values
For a more comprehensive dataset with many more features check out the "Yacht/Motorboat Pricing Data (10,000+ listings)" dataset. Link below:…
0 runs0 likes0 downloads0 reach0 impact
2530 instances - 18 features - classes - 11212 missing values
Uber vs Lyft This is a very beginner-friendly dataset. It does contain a lot of NA values. It is a good dataset if you want to use a Linear Regression Model to see the pattern between different…
0 runs0 likes0 downloads0 reach0 impact
693071 instances - 56 features - classes - 55095 missing values
Context Census of India is a rich database which can tell stories of over a billion Indians. It is important not only for research point of view, but commercially as well for the organizations that…
0 runs0 likes0 downloads0 reach0 impact
590 instances - 82 features - classes - 3219 missing values
Context At Shadow Robot, we are leaders in robotic grasping and manipulation. As part of our Smart Grasping System development, we're developing different algorithms using machine learning. This first…
0 runs0 likes0 downloads0 reach0 impact
992641 instances - 29 features - classes - 0 missing values
Context The goal of this dataset is to provide a tidy way to access to the transcripts of speeches given by various US politicians in the context of the 2020 US Presidential Election. Transcripts have…
0 runs0 likes0 downloads0 reach0 impact
269 instances - 6 features - classes - 42 missing values
Context This dataset contains information on all 802 Pokemon from all Seven Generations of Pokemon. The information contained in this dataset include Base Stats, Performance against Other Types,…
0 runs0 likes0 downloads0 reach0 impact
801 instances - 41 features - classes - 522 missing values
1. Title: Contraceptive Method Choice 2. Sources: (a) Origin: This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey (b) Creator: Tjen-Sien Lim (limtstat.wisc.edu) (c)…
0 runs0 likes0 downloads0 reach0 impact
1472 instances - 10 features - classes - 0 missing values
Context The ecological footprint measures the ecological assets that a given population requires to produce the natural resources it consumes (including plant-based food and fiber products, livestock…
0 runs0 likes0 downloads0 reach0 impact
188 instances - 21 features - classes - 181 missing values
Context Pantheon is a project celebrating the cultural information that endows our species with these fantastic capacities. To celebrate our global cultural heritage we are compiling, analyzing and…
0 runs0 likes0 downloads0 reach0 impact
11341 instances - 17 features - classes - 11329 missing values
Overview Identification COUNTRY Rwanda TITLE Integrated Household Living Conditions Survey 2010-2011 TRANSLATED TITLE Enqute Intgrale sur les conditions de vie des mnages 2010-2011 STUDY TYPE…
0 runs0 likes0 downloads0 reach0 impact
78432 instances - 23 features - classes - 907612 missing values
Context This is the dataset used in the second chapter of Aurlien Gron's recent book 'Hands-On Machine learning with Scikit-Learn and TensorFlow'. It serves as an excellent introduction to…
0 runs0 likes0 downloads0 reach0 impact
20640 instances - 10 features - classes - 207 missing values
Context The two datasets are related to red and white variants of the Portuguese "Vinho Verde" wine. For more details, consult the reference [Cortez et al., 2009]. Due to privacy and logistic issues,…
0 runs0 likes0 downloads0 reach0 impact
1599 instances - 12 features - classes - 0 missing values
Uncover the factors that lead to employee attrition and explore important questions such as show me a breakdown of distance from home by job role and attrition or compare average monthly income by…
0 runs0 likes0 downloads0 reach0 impact
1470 instances - 35 features - classes - 0 missing values
About our Dataset The journey of the collection of this Covid-19 India dataset begin with a competition where we have to do sentiment analysis of tweets. The data was collected from…
0 runs0 likes0 downloads0 reach0 impact
648958 instances - 4 features - classes - 10980 missing values
Context Every year a lot of people migrate to different countries from Pakistan and a lot of them migrate to Pakistan as emigrants of refugees, Pakistan ranks 2nd, according to UNHCR, among the…
0 runs0 likes0 downloads0 reach0 impact
264 instances - 14 features - classes - 0 missing values
Data Set Information: A dataset of manually-curated BCF for 779 chemicals was used to determine the mechanisms of bioconcentration, i.e. to predict whether a chemical: (1) is mainly stored within…
0 runs0 likes0 downloads0 reach0 impact
779 instances - 13 features - classes - 0 missing values
Context Indian Premier League (IPL) is a Twenty20 cricket format league in India. It is usually played in April and May every year. As of 2019, the title sponsor of the game is Vivo. The league was…
0 runs0 likes0 downloads0 reach0 impact
756 instances - 18 features - classes - 656 missing values
Context The projects goal is to use this dataset to predict the level of success game developers should expect given their game design details. Features such as 'Indie' (developed by indie studio),…
0 runs0 likes0 downloads0 reach0 impact
30250 instances - 27 features - classes - 188331 missing values
Context Electronic dance music (EDM) is a genre where thousands of new songs are released every week. The list of EDM subgenres considered is long, but it also evolves according to trends and musical…
0 runs0 likes0 downloads0 reach0 impact
2900 instances - 94 features - classes - 0 missing values
Do two sentences come from the same article? We randomly sampled sentences from across Wikipedia. Some sentences came from the same articles, others do not. Sentences from the Same Article These two…
0 runs0 likes0 downloads0 reach0 impact
129156 instances - 3 features - classes - 0 missing values
Context After the release of Pokmon Sword and Shield on the November 15, 2019, 81 new Pokemons were released alongside 13 regional variants of preexisting Pokmon. Content Like dataset from previous…
0 runs0 likes0 downloads0 reach0 impact
1017 instances - 13 features - classes - 479 missing values
This dataset stems from Alberto Barradas' popular Pokemon with stats dataset by listing the tiered Pokemon in Smogon 6v6. Smogon 6v6 is one of the most popular formats for competitive Pokemon.…
0 runs0 likes0 downloads0 reach0 impact
499 instances - 15 features - classes - 210 missing values
Context World fact sheet, fun to link with other datasets. Content Information on population, region, area size, infant mortality and more. Acknowledgements Source: All these data sets are made up of…
0 runs0 likes0 downloads0 reach0 impact
227 instances - 20 features - classes - 110 missing values
Acknowledgements The data was scraped from Booking.com. All data in the file is publicly available to everyone already. Data is originally owned by Booking.com. Please contact me through my profile if…
0 runs0 likes0 downloads0 reach0 impact
515738 instances - 17 features - classes - 6536 missing values
Context This is a dataset for a larger project I have been working on. My idea is to analyze and compare real historical weather with weather folklore. Content The CSV file includes a hourly/daily…
0 runs0 likes0 downloads0 reach0 impact
96453 instances - 12 features - classes - 517 missing values
Context Some camera enthusiast went and described 1,000 cameras based on 13 properties! Content Row one describes the datatype for each column and can probably be removed. The 13 properties of each…
0 runs0 likes0 downloads0 reach0 impact
1038 instances - 13 features - 0 classes - 7 missing values
Context The Federal Reserve sets interest rates to promote conditions that achieve the mandate set by the Congress high employment, low and stable inflation, sustainable economic growth, and moderate…
0 runs0 likes0 downloads0 reach0 impact
904 instances - 10 features - classes - 3196 missing values
Context Data Set with the football matches of the Spanish league of the 1st and 2nd division from the 1970-71 to 2016-17 season, has been created with the aim of opening a line of research in the…
0 runs0 likes0 downloads0 reach0 impact
37147 instances - 10 features - classes - 0 missing values