Data
dataset_credit_score

dataset_credit_score

active ARFF CC BY 4.0 Visibility: public Uploaded 13-12-2024 by Sebastian Silva Ruiz
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Financial dataset for automl benchmark. Name = dataset_credit_score, target = credit_score

28 features

credit_score (target)nominal3 unique values
0 missing
idnumeric100000 unique values
0 missing
customer_idnumeric12500 unique values
0 missing
monthnumeric8 unique values
0 missing
namenumeric10140 unique values
0 missing
agenumeric1788 unique values
0 missing
ssnnumeric12501 unique values
0 missing
occupationnumeric16 unique values
0 missing
annual_incomenumeric18940 unique values
0 missing
monthly_inhand_salarynumeric13235 unique values
15002 missing
num_bank_accountsnumeric943 unique values
0 missing
num_credit_cardnumeric1179 unique values
0 missing
interest_ratenumeric1750 unique values
0 missing
num_of_loannumeric434 unique values
0 missing
type_of_loannumeric6261 unique values
0 missing
delay_from_due_datenumeric73 unique values
0 missing
num_of_delayed_paymentnumeric750 unique values
0 missing
changed_credit_limitnumeric4324 unique values
0 missing
num_credit_inquiriesnumeric1223 unique values
1965 missing
credit_mixnumeric4 unique values
0 missing
outstanding_debtnumeric13178 unique values
0 missing
credit_utilization_rationumeric100000 unique values
0 missing
credit_history_agenumeric405 unique values
0 missing
payment_of_min_amountnumeric3 unique values
0 missing
total_emi_per_monthnumeric14950 unique values
0 missing
amount_invested_monthlynumeric91050 unique values
0 missing
payment_behaviournumeric7 unique values
0 missing
monthly_balancenumeric98793 unique values
0 missing

19 properties

100000
Number of instances (rows) of the dataset.
28
Number of attributes (columns) of the dataset.
3
Number of distinct values of the target attribute (if it is nominal).
16967
Number of missing values in the dataset.
16684
Number of instances with at least one value missing.
27
Number of numeric attributes.
1
Number of nominal attributes.
0
Percentage of binary attributes.
16.68
Percentage of instances having missing values.
0.77
Average class difference between consecutive instances.
0.61
Percentage of missing values.
0
Number of attributes divided by the number of instances.
96.43
Percentage of numeric attributes.
53.17
Percentage of instances belonging to the most frequent class.
3.57
Percentage of nominal attributes.
53174
Number of instances belonging to the most frequent class.
17.83
Percentage of instances belonging to the least frequent class.
17828
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

1 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: credit_score
Define a new task