Data
Filter results by:
This data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics, (b) its assigned insurance risk rating, (c) its normalized losses in use as…
0 runs0 likes0 downloads0 reach0 impact
1000000 instances - 26 features - 0 classes - 0 missing values
Thyroid disease records supplied by the Garavan Institute and J. Ross Quinlan, New South Wales Institute, Syndney, Australia. 1987.
0 runs0 likes0 downloads0 reach0 impact
3103 instances - 23 features - 2 classes - 0 missing values
The dataset consists of feature vectors belonging to 12,330 sessions.The dataset was formed so that each sessionwould belong to a different user in a 1-year period to avoidany tendency to a specific…
0 runs0 likes0 downloads0 reach0 impact
12330 instances - 18 features - 2 classes - 0 missing values
This dataset contain attributes of dresses and their recommendations according to their sales. Sales are monitor on the basis of alternate days.The attributes present analyzed are: Recommendation,…
0 runs0 likes0 downloads0 reach0 impact
500 instances - 13 features - 0 classes - 0 missing values
This data set contains details of a bank's customers and the target variable is a binary variable reflecting the fact whether the customer left the bank (closed his account) or he continues to be a…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 11 features - 0 classes - 0 missing values
The dataset represents 10 years (1999-2008) of clinical care at 130 US hospitals and integrated delivery networks. It includes over 50 features representing patient and hospital outcomes. Information…
0 runs0 likes0 downloads0 reach0 impact
101766 instances - 48 features - 3 classes - 192849 missing values
One of the biggest challenges of an auto dealership purchasing a used car at an auto auction is the risk of that the vehicle might have serious issues that prevent it from being sold to customers. The…
0 runs0 likes0 downloads0 reach0 impact
67212 instances - 31 features - 0 classes - 0 missing values
Many people struggle to get loans due to insufficient or non-existent credit histories. And, unfortunately, this population is often taken advantage of by untrustworthy lenders.Home Credit strives to…
0 runs0 likes0 downloads0 reach0 impact
244280 instances - 70 features - 0 classes - 0 missing values
This dataset is for classification tasks, and has both continuous and categorical variables.
0 runs0 likes0 downloads0 reach0 impact
5032 instances - 46 features - 0 classes - 0 missing values
A dataset relating characteristics of telephony account features and usage and whether or not the customer churned. Originally used in Discovering Knowledge in Data: An Introduction to Data…
0 runs0 likes0 downloads0 reach0 impact
5000 instances - 21 features - 0 classes - 0 missing values
Nomao collects data about places (name, phone, localization...) from many sources. Deduplication consists in detecting what data refer to the same place. Instances in the dataset compare 2 spots.The…
0 runs0 likes0 downloads0 reach0 impact
34465 instances - 119 features - 0 classes - 0 missing values
Data reported to the police about the circumstances of personal injury road accidents in Great Britain from 1979, and the maker and model information of vehicles involved in the respective…
0 runs0 likes0 downloads0 reach0 impact
111762 instances - 33 features - 0 classes - 0 missing values
This dataset is for classification tasks, and has both continuous and categorical variables.
0 runs0 likes0 downloads0 reach0 impact
100959 instances - 30 features - 0 classes - 0 missing values
User profile data for San Francisco OkCupid users published in [Kim, A. Y., & Escobedo-Land, A. (2015). OKCupid data for introductory statistics and data science courses. Journal of Statistics…
0 runs0 likes0 downloads0 reach0 impact
26677 instances - 14 features - 3 classes - 0 missing values
Prediction task is to determine whether a person makes over 50K a year. Extraction was done by Barry Becker from the 1994 Census database. A set of reasonably clean records was extracted using the…
0 runs0 likes0 downloads0 reach0 impact
48842 instances - 15 features - 2 classes - 0 missing values
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect the confidentiality of the data.This dataset is interesting because…
0 runs0 likes0 downloads0 reach0 impact
653 instances - 16 features - 2 classes - 0 missing values
The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more than one contact to the same client was…
0 runs0 likes0 downloads0 reach0 impact
45211 instances - 17 features - 0 classes - 0 missing values
Jarkko Salojarvi, Kai Puolamaki, Jaana Simola, Lauri Kovanen, Ilpo Kojo, Samuel Kaski. Inferring Relevance from Eye Movements: Feature Extraction. Helsinki University of Technology, Publications in…
0 runs0 likes0 downloads0 reach0 impact
7608 instances - 24 features - 0 classes - 0 missing values
An international e-commerce company based wants to discover key insights from their customer database. They want to use some of the most advanced machine learning techniques to study their customers.…
0 runs0 likes0 downloads0 reach0 impact
10999 instances - 10 features - 0 classes - 0 missing values
This is a "supervised learning" challenge in machine learning. We are making available 30 datasets, all pre-formatted in given feature representations (this means that each example consists of a fixed…
0 runs0 likes0 downloads0 reach0 impact
2984 instances - 145 features - 0 classes - 0 missing values
This dataset classifies people described by a set of attributes as good or bad credit risks.This dataset comes with a cost matrix:Good Bad (predicted) Good 0 1 (actual)Bad 5 0 It is worse to class a…
0 runs0 likes0 downloads0 reach0 impact
1000 instances - 21 features - 2 classes - 0 missing values
test
0 runs0 likes0 downloads0 reach0 impact
48842 instances - 15 features - 2 classes - 6465 missing values
This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey. The samples are married women who were either not pregnant or do not know if they were at the time of…
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - 0 classes - 0 missing values
This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey. The samples are married women who were either not pregnant or do not know if they were at the time of…
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - 0 classes - 0 missing values
This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey. The samples are married women who were either not pregnant or do not know if they were at the time of…
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - 0 classes - 0 missing values
This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey. The samples are married women who were either not pregnant or do not know if they were at the time of…
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - 0 classes - 0 missing values
This dataset is a subset of the 1987 National Indonesia Contraceptive Prevalence Survey. The samples are married women who were either not pregnant or do not know if they were at the time of…
0 runs0 likes0 downloads0 reach0 impact
1473 instances - 10 features - 0 classes - 0 missing values
The QSAR biodegradation dataset was built in the Milano Chemometrics and QSAR Research Group. The research leading to these results has received funding from the European Communitys Seventh Framework…
0 runs0 likes0 downloads0 reach0 impact
1055 instances - 41 features - 2 classes - 0 missing values
This dataset is for classification tasks, and has both continuous and categorical variables.
0 runs0 likes0 downloads0 reach0 impact
23548 instances - 11 features - 0 classes - 0 missing values
Airlines Dataset Inspired in the regression dataset from Elena Ikonomovska. The task is to predict whether a given flight will be delayed, given the information of the scheduled departure.
0 runs0 likes0 downloads0 reach0 impact
539383 instances - 8 features - 0 classes - 0 missing values