Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
Credit_Score_Classification_downsampled
ARFF
CSV
JSON
XML
RDF
Credit_Score_Classification_downsampled
active
ARFF
CC BY 4.0
Visibility: public
Uploaded 27-11-2024 by
Anna Wiewer
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
This dataset contains customer credit score information, which can be used for classification purposes.
28 features
Credit_Score
(target)
string
3 unique values
0 missing
ID
string
50000 unique values
0 missing
Customer_ID
string
12454 unique values
0 missing
Month
string
8 unique values
0 missing
Name
string
10088 unique values
4989 missing
Age
numeric
904 unique values
0 missing
SSN
string
12434 unique values
0 missing
Occupation
string
16 unique values
0 missing
Annual_Income
numeric
12952 unique values
0 missing
Monthly_Inhand_Salary
numeric
12838 unique values
7507 missing
Num_Bank_Accounts
numeric
568 unique values
0 missing
Num_Credit_Card
numeric
821 unique values
0 missing
Interest_Rate
numeric
958 unique values
0 missing
Num_of_Loan
numeric
248 unique values
0 missing
Type_of_Loan
string
6240 unique values
5784 missing
Delay_from_due_date
numeric
68 unique values
0 missing
Num_of_Delayed_Payment
numeric
394 unique values
3528 missing
Changed_Credit_Limit
numeric
3915 unique values
1062 missing
Num_Credit_Inquiries
numeric
689 unique values
993 missing
Credit_Mix
string
4 unique values
0 missing
Outstanding_Debt
numeric
12159 unique values
0 missing
Credit_Utilization_Ratio
numeric
50000 unique values
0 missing
Credit_History_Age
string
404 unique values
4548 missing
Payment_of_Min_Amount
string
3 unique values
0 missing
Total_EMI_per_month
numeric
13184 unique values
0 missing
Amount_invested_monthly
numeric
45451 unique values
2273 missing
Payment_Behaviour
string
7 unique values
0 missing
Monthly_Balance
numeric
49408 unique values
587 missing
Show all 28 features
19 properties
NumberOfInstances
50000
Number of instances (rows) of the dataset.
NumberOfFeatures
28
Number of attributes (columns) of the dataset.
NumberOfClasses
3
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
31271
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
24191
Number of instances with at least one value missing.
NumberOfNumericFeatures
16
Number of numeric attributes.
NumberOfSymbolicFeatures
0
Number of nominal attributes.
PercentageOfBinaryFeatures
0
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
48.38
Percentage of instances having missing values.
PercentageOfMissingValues
2.23
Percentage of missing values.
AutoCorrelation
1
Average class difference between consecutive instances.
PercentageOfNumericFeatures
57.14
Percentage of numeric attributes.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfSymbolicFeatures
0
Percentage of nominal attributes.
MajorityClassPercentage
53.2
Percentage of instances belonging to the most frequent class.
MajorityClassSize
26602
Number of instances belonging to the most frequent class.
MinorityClassPercentage
17.96
Percentage of instances belonging to the least frequent class.
MinorityClassSize
8980
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
0
Number of binary attributes.
Show all 19 properties
1 tasks
Supervised Classification on Credit_Score_Classification_downsampled
0 runs
- estimation_procedure: 10-fold Crossvalidation - target_feature: Credit_Score
Define a new task