Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
HMEQ_Data_Preprocessed
ARFF
CSV
JSON
XML
RDF
HMEQ_Data_Preprocessed
active
ARFF
Publicly available
Visibility: public
Uploaded 06-12-2024 by
B Gun
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
Predict clients who default on their loan. Dataset is uploaded from kaggle, see citation for the link.
21 features
bad
(target)
nominal
2 unique values
0 missing
loan
numeric
540 unique values
0 missing
mortdue
numeric
5053 unique values
518 missing
value
numeric
5381 unique values
112 missing
yoj
numeric
99 unique values
515 missing
derog
numeric
11 unique values
708 missing
delinq
numeric
14 unique values
580 missing
clage
numeric
5314 unique values
308 missing
ninq
numeric
16 unique values
510 missing
clno
numeric
62 unique values
222 missing
debtinc
numeric
4693 unique values
1267 missing
reason_debtcon
numeric
2 unique values
0 missing
reason_homeimp
numeric
2 unique values
0 missing
reason_missing
numeric
2 unique values
0 missing
job_mgr
numeric
2 unique values
0 missing
job_missing
numeric
2 unique values
0 missing
job_office
numeric
2 unique values
0 missing
job_other
numeric
2 unique values
0 missing
job_profexe
numeric
2 unique values
0 missing
job_sales
numeric
2 unique values
0 missing
job_self
numeric
2 unique values
0 missing
Show all 21 features
19 properties
NumberOfInstances
5960
Number of instances (rows) of the dataset.
NumberOfFeatures
21
Number of attributes (columns) of the dataset.
NumberOfClasses
2
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
4740
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
2445
Number of instances with at least one value missing.
NumberOfNumericFeatures
20
Number of numeric attributes.
NumberOfSymbolicFeatures
1
Number of nominal attributes.
PercentageOfBinaryFeatures
4.76
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
41.02
Percentage of instances having missing values.
PercentageOfMissingValues
3.79
Percentage of missing values.
AutoCorrelation
0.78
Average class difference between consecutive instances.
PercentageOfNumericFeatures
95.24
Percentage of numeric attributes.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfSymbolicFeatures
4.76
Percentage of nominal attributes.
MajorityClassPercentage
80.05
Percentage of instances belonging to the most frequent class.
MajorityClassSize
4771
Number of instances belonging to the most frequent class.
MinorityClassPercentage
19.95
Percentage of instances belonging to the least frequent class.
MinorityClassSize
1189
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
1
Number of binary attributes.
Show all 19 properties
1 tasks
Supervised Classification on HMEQ_Data_Preprocessed
0 runs
- estimation_procedure: 10-fold Crossvalidation - target_feature: bad
Define a new task