Data
credit_risk_china

credit_risk_china

active ARFF CC BY 4.0 Visibility: public Uploaded 25-11-2024 by Sebastian Silva Ruiz
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Multi-classification assessment of bank (china) personal credit risk based on multi-source information fusion

28 features

five_categories (target)nominal5 unique values
0 missing
customer_idnumeric27522 unique values
0 missing
type_of_loan_businessnominal22 unique values
0 missing
guarantee_the_balancenumeric20191 unique values
304 missing
account_connection_amountnumeric24789 unique values
304 missing
security_guarantee_amountnumeric20209 unique values
304 missing
five-level_classificationnominal11 unique values
0 missing
whether_interest_is_owednominal2 unique values
0 missing
whether_self-service_loannominal2 unique values
0 missing
type_of_guaranteenominal17 unique values
299 missing
safety_coefficientnumeric8 unique values
304 missing
collateral_value_(yuan)numeric11577 unique values
322 missing
guarantee_methodnominal4 unique values
304 missing
date_codenominal3 unique values
0 missing
approval_deadlinenumeric37 unique values
0 missing
whether_devalue_accountnominal2 unique values
0 missing
industry_categorynominal19 unique values
25444 missing
down_payment_amountnumeric10036 unique values
620 missing
whether_personal_business_loannominal2 unique values
0 missing
whether_interest_is_owed_(regulatory_standard)nominal2 unique values
0 missing
repayment_typenominal2 unique values
745 missing
installment_repayment_method_(numerical_type)numeric2 unique values
757 missing
installment_repayment_method_(discrete_type)nominal2 unique values
757 missing
installment_repayment_cycle_(numerical_type)nominal2 unique values
255 missing
repayment_cycle_(discrete_type)nominal2 unique values
255 missing
number_of_housesnumeric4 unique values
4571 missing
month_property_costsnumeric3344 unique values
1286 missing
family_monthly_incomenumeric2940 unique values
20 missing

19 properties

27522
Number of instances (rows) of the dataset.
28
Number of attributes (columns) of the dataset.
5
Number of distinct values of the target attribute (if it is nominal).
36851
Number of missing values in the dataset.
26342
Number of instances with at least one value missing.
12
Number of numeric attributes.
16
Number of nominal attributes.
32.14
Percentage of binary attributes.
95.71
Percentage of instances having missing values.
0.87
Average class difference between consecutive instances.
4.78
Percentage of missing values.
0
Number of attributes divided by the number of instances.
42.86
Percentage of numeric attributes.
92.71
Percentage of instances belonging to the most frequent class.
57.14
Percentage of nominal attributes.
25516
Number of instances belonging to the most frequent class.
0.37
Percentage of instances belonging to the least frequent class.
103
Number of instances belonging to the least frequent class.
9
Number of binary attributes.

1 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: five_categories
Define a new task