Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
HMEQ_Data
ARFF
CSV
JSON
XML
RDF
HMEQ_Data
active
ARFF
Publicly available
Visibility: public
Uploaded 22-11-2024 by
Bilge Gun
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
Predict clients who default on their loan. Dataset is uploaded from kaggle, see citation for the link.
21 features
bad
(target)
nominal
2 unique values
0 missing
loan
numeric
540 unique values
0 missing
mortdue
numeric
5053 unique values
518 missing
value
numeric
5381 unique values
112 missing
yoj
numeric
99 unique values
515 missing
derog
numeric
11 unique values
708 missing
delinq
numeric
14 unique values
580 missing
clage
numeric
5314 unique values
308 missing
ninq
numeric
16 unique values
510 missing
clno
numeric
62 unique values
222 missing
debtinc
numeric
4693 unique values
1267 missing
job_Mgr
string
2 unique values
0 missing
job_Office
string
2 unique values
0 missing
job_Other
string
2 unique values
0 missing
job_ProfExe
string
2 unique values
0 missing
job_Sales
string
2 unique values
0 missing
job_Self
string
2 unique values
0 missing
job_nan
string
2 unique values
0 missing
reason_DebtCon
string
2 unique values
0 missing
reason_HomeImp
string
2 unique values
0 missing
reason_nan
string
2 unique values
0 missing
Show all 21 features
19 properties
NumberOfInstances
5960
Number of instances (rows) of the dataset.
NumberOfFeatures
21
Number of attributes (columns) of the dataset.
NumberOfClasses
2
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
4740
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
2445
Number of instances with at least one value missing.
NumberOfNumericFeatures
10
Number of numeric attributes.
NumberOfSymbolicFeatures
1
Number of nominal attributes.
PercentageOfInstancesWithMissingValues
41.02
Percentage of instances having missing values.
AutoCorrelation
0.78
Average class difference between consecutive instances.
PercentageOfMissingValues
3.79
Percentage of missing values.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfNumericFeatures
47.62
Percentage of numeric attributes.
MajorityClassPercentage
80.05
Percentage of instances belonging to the most frequent class.
PercentageOfSymbolicFeatures
4.76
Percentage of nominal attributes.
MajorityClassSize
4771
Number of instances belonging to the most frequent class.
MinorityClassPercentage
19.95
Percentage of instances belonging to the least frequent class.
MinorityClassSize
1189
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
1
Number of binary attributes.
PercentageOfBinaryFeatures
4.76
Percentage of binary attributes.
Show all 19 properties
1 tasks
Supervised Classification on HMEQ_Data
0 runs
- estimation_procedure: 10-fold Crossvalidation - target_feature: bad
Define a new task