Data
Loan_Approval_Status_Classification

Loan_Approval_Status_Classification

active ARFF CC BY 4.0 Visibility: public Uploaded 12-12-2024 by Yayun Li
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
# Loan Approval Classification Dataset This dataset is a synthetic version inspired by the original Credit Risk dataset on Kaggle and enriched with additional variables based on historical loan approval data. SMOTENC was used to simulate new data points to enlarge the databank. The dataset is structured for both categorical and continuous features. ## Target Variable The target variable 'loan_status' is binary: - 1 = approved - 0 = rejected

24 features

loan_status (target)nominal2 unique values
0 missing
person_agenumeric60 unique values
0 missing
person_educationnumeric5 unique values
0 missing
person_incomenumeric33989 unique values
0 missing
person_emp_expnumeric63 unique values
0 missing
loan_amntnumeric4483 unique values
0 missing
loan_int_ratenumeric1302 unique values
0 missing
loan_percent_incomenumeric64 unique values
0 missing
cb_person_cred_hist_lengthnumeric29 unique values
0 missing
credit_scorenumeric340 unique values
0 missing
defaults_nonumeric2 unique values
0 missing
defaults_yesnumeric2 unique values
0 missing
gender_femalenumeric2 unique values
0 missing
gender_malenumeric2 unique values
0 missing
home_mortgagenumeric2 unique values
0 missing
home_othernumeric2 unique values
0 missing
home_ownnumeric2 unique values
0 missing
home_rentnumeric2 unique values
0 missing
intent_debtconsolidationnumeric2 unique values
0 missing
intent_educationnumeric2 unique values
0 missing
intent_homeimprovementnumeric2 unique values
0 missing
intent_medicalnumeric2 unique values
0 missing
intent_personalnumeric2 unique values
0 missing
intent_venturenumeric2 unique values
0 missing

19 properties

45000
Number of instances (rows) of the dataset.
24
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
23
Number of numeric attributes.
1
Number of nominal attributes.
1
Number of binary attributes.
4.17
Percentage of binary attributes.
0
Percentage of instances having missing values.
0.78
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
95.83
Percentage of numeric attributes.
77.78
Percentage of instances belonging to the most frequent class.
4.17
Percentage of nominal attributes.
35000
Number of instances belonging to the most frequent class.
22.22
Percentage of instances belonging to the least frequent class.
10000
Number of instances belonging to the least frequent class.

1 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: loan_status
Define a new task