Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
dataset_credit_score
ARFF
CSV
JSON
XML
RDF
dataset_credit_score
active
ARFF
CC BY 4.0
Visibility: public
Uploaded 13-12-2024 by
Sebastian Silva Ruiz
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
Financial dataset for automl benchmark. Name = dataset_credit_score, target = credit_score
28 features
credit_score
(target)
nominal
3 unique values
0 missing
id
numeric
100000 unique values
0 missing
customer_id
numeric
12500 unique values
0 missing
month
numeric
8 unique values
0 missing
name
numeric
10140 unique values
0 missing
age
numeric
1788 unique values
0 missing
ssn
numeric
12501 unique values
0 missing
occupation
numeric
16 unique values
0 missing
annual_income
numeric
18940 unique values
0 missing
monthly_inhand_salary
numeric
13235 unique values
15002 missing
num_bank_accounts
numeric
943 unique values
0 missing
num_credit_card
numeric
1179 unique values
0 missing
interest_rate
numeric
1750 unique values
0 missing
num_of_loan
numeric
434 unique values
0 missing
type_of_loan
numeric
6261 unique values
0 missing
delay_from_due_date
numeric
73 unique values
0 missing
num_of_delayed_payment
numeric
750 unique values
0 missing
changed_credit_limit
numeric
4324 unique values
0 missing
num_credit_inquiries
numeric
1223 unique values
1965 missing
credit_mix
numeric
4 unique values
0 missing
outstanding_debt
numeric
13178 unique values
0 missing
credit_utilization_ratio
numeric
100000 unique values
0 missing
credit_history_age
numeric
405 unique values
0 missing
payment_of_min_amount
numeric
3 unique values
0 missing
total_emi_per_month
numeric
14950 unique values
0 missing
amount_invested_monthly
numeric
91050 unique values
0 missing
payment_behaviour
numeric
7 unique values
0 missing
monthly_balance
numeric
98793 unique values
0 missing
Show all 28 features
19 properties
NumberOfInstances
100000
Number of instances (rows) of the dataset.
NumberOfFeatures
28
Number of attributes (columns) of the dataset.
NumberOfClasses
3
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
16967
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
16684
Number of instances with at least one value missing.
NumberOfNumericFeatures
27
Number of numeric attributes.
NumberOfSymbolicFeatures
1
Number of nominal attributes.
PercentageOfBinaryFeatures
0
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
16.68
Percentage of instances having missing values.
AutoCorrelation
0.77
Average class difference between consecutive instances.
PercentageOfMissingValues
0.61
Percentage of missing values.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfNumericFeatures
96.43
Percentage of numeric attributes.
MajorityClassPercentage
53.17
Percentage of instances belonging to the most frequent class.
PercentageOfSymbolicFeatures
3.57
Percentage of nominal attributes.
MajorityClassSize
53174
Number of instances belonging to the most frequent class.
MinorityClassPercentage
17.83
Percentage of instances belonging to the least frequent class.
MinorityClassSize
17828
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
0
Number of binary attributes.
Show all 19 properties
1 tasks
Supervised Classification on dataset_credit_score
0 runs
- estimation_procedure: 10-fold Crossvalidation - target_feature: credit_score
Define a new task