Context
The original dataset contains 1000 entries with 20 categorial/symbolic attributes prepared by Prof. Hofmann. In this dataset, each entry represents a person who takes a credit by a bank. Each person is classified as good or bad credit risks according to the set of attributes. The link to the original dataset can be found below.
Content
It is almost impossible to understand the original dataset due to its complicated system of categories and symbols. Thus, I wrote a small Python script to convert it into a readable CSV file. The column names were also given in German originally. So, they are replaced by English names while processing. The attributes and their details in English are given below:
Status - Categorical (Ordinal)
Duration - Numerical
Credit History - Categorical (Nominal)
Purpose - Categorical (Nominal)
Amount - Numerical
Savings - Categorical (Ordinal)
Employment Duration - Categorical (Ordinal)
Installment Rate - Categorical (Ordinal)
Personal Status Sex - Categorical (Nominal)
Other Debtors - Categorical (Nominal)
Present Residence - Categorical (Ordinal)
Property - Categorical (Nominal)
Age - Numerical
Other Installment Plans - Categorical (Nominal)
Housing - Categorical (Nominal)
Number Credits - Categorical (Ordinal)
Job - Categorical (Nominal)
People Liable - Categorical (Ordinal)
Telephone - Categorical (Nominal)
Foreign Worker - Categorical (Nominal)
Credit Risk - Binary Target Variable
Acknowledgements
Source : UCI