Data
Default-of-Credit-Card-Clients-Dataset

Default-of-Credit-Card-Clients-Dataset

active ARFF CC0: Public Domain Visibility: public Uploaded 23-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Dataset Information This dataset contains information on default payments, demographic factors, credit data, history of payment, and bill statements of credit card clients in Taiwan from April 2005 to September 2005. Content There are 25 variables: ID: ID of each client LIMIT_BAL: Amount of given credit in NT dollars (includes individual and family/supplementary credit SEX: Gender (1=male, 2=female) EDUCATION: (1=graduate school, 2=university, 3=high school, 4=others, 5=unknown, 6=unknown) MARRIAGE: Marital status (1=married, 2=single, 3=others) AGE: Age in years PAY_0: Repayment status in September, 2005 (-1=pay duly, 1=payment delay for one month, 2=payment delay for two months, 8=payment delay for eight months, 9=payment delay for nine months and above) PAY_2: Repayment status in August, 2005 (scale same as above) PAY_3: Repayment status in July, 2005 (scale same as above) PAY_4: Repayment status in June, 2005 (scale same as above) PAY_5: Repayment status in May, 2005 (scale same as above) PAY_6: Repayment status in April, 2005 (scale same as above) BILL_AMT1: Amount of bill statement in September, 2005 (NT dollar) BILL_AMT2: Amount of bill statement in August, 2005 (NT dollar) BILL_AMT3: Amount of bill statement in July, 2005 (NT dollar) BILL_AMT4: Amount of bill statement in June, 2005 (NT dollar) BILL_AMT5: Amount of bill statement in May, 2005 (NT dollar) BILL_AMT6: Amount of bill statement in April, 2005 (NT dollar) PAY_AMT1: Amount of previous payment in September, 2005 (NT dollar) PAY_AMT2: Amount of previous payment in August, 2005 (NT dollar) PAY_AMT3: Amount of previous payment in July, 2005 (NT dollar) PAY_AMT4: Amount of previous payment in June, 2005 (NT dollar) PAY_AMT5: Amount of previous payment in May, 2005 (NT dollar) PAY_AMT6: Amount of previous payment in April, 2005 (NT dollar) default.payment.next.month: Default payment (1=yes, 0=no) Inspiration Some ideas for exploration: How does the probability of default payment vary by categories of different demographic variables? Which variables are the strongest predictors of default payment? Acknowledgements Any publications based on this dataset should acknowledge the following: Lichman, M. (2013). UCI Machine Learning Repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science. The original dataset can be found here at the UCI Machine Learning Repository.

24 features

ID (ignore)numeric30000 unique values
0 missing
LIMIT_BALnumeric81 unique values
0 missing
SEXnumeric2 unique values
0 missing
EDUCATIONnumeric7 unique values
0 missing
MARRIAGEnumeric4 unique values
0 missing
AGEnumeric56 unique values
0 missing
PAY_0numeric11 unique values
0 missing
PAY_2numeric11 unique values
0 missing
PAY_3numeric11 unique values
0 missing
PAY_4numeric11 unique values
0 missing
PAY_5numeric10 unique values
0 missing
PAY_6numeric10 unique values
0 missing
BILL_AMT1numeric22723 unique values
0 missing
BILL_AMT2numeric22346 unique values
0 missing
BILL_AMT3numeric22026 unique values
0 missing
BILL_AMT4numeric21548 unique values
0 missing
BILL_AMT5numeric21010 unique values
0 missing
BILL_AMT6numeric20604 unique values
0 missing
PAY_AMT1numeric7943 unique values
0 missing
PAY_AMT2numeric7899 unique values
0 missing
PAY_AMT3numeric7518 unique values
0 missing
PAY_AMT4numeric6937 unique values
0 missing
PAY_AMT5numeric6897 unique values
0 missing
PAY_AMT6numeric6939 unique values
0 missing
default.payment.next.monthnumeric2 unique values
0 missing

19 properties

30000
Number of instances (rows) of the dataset.
24
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
24
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task