OpenML
German-Credit-Data

German-Credit-Data

active ARFF Database: Open Database, Contents: Database Contents Visibility: public Uploaded 24-03-2022 by Elif Ceren Gok
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Physical Sciences Transportation
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context The original dataset contains 1000 entries with 20 categorial/symbolic attributes prepared by Prof. Hofmann. In this dataset, each entry represents a person who takes a credit by a bank. Each person is classified as good or bad credit risks according to the set of attributes. The link to the original dataset can be found below. Content It is almost impossible to understand the original dataset due to its complicated system of categories and symbols. Thus, I wrote a small Python script to convert it into a readable CSV file. The column names were also given in German originally. So, they are replaced by English names while processing. The attributes and their details in English are given below: Status - Categorical (Ordinal) Duration - Numerical Credit History - Categorical (Nominal) Purpose - Categorical (Nominal) Amount - Numerical Savings - Categorical (Ordinal) Employment Duration - Categorical (Ordinal) Installment Rate - Categorical (Ordinal) Personal Status Sex - Categorical (Nominal) Other Debtors - Categorical (Nominal) Present Residence - Categorical (Ordinal) Property - Categorical (Nominal) Age - Numerical Other Installment Plans - Categorical (Nominal) Housing - Categorical (Nominal) Number Credits - Categorical (Ordinal) Job - Categorical (Nominal) People Liable - Categorical (Ordinal) Telephone - Categorical (Nominal) Foreign Worker - Categorical (Nominal) Credit Risk - Binary Target Variable Acknowledgements Source : UCI

21 features

laufkontnumeric4 unique values
0 missing
laufzeitnumeric33 unique values
0 missing
moralnumeric5 unique values
0 missing
verwnumeric10 unique values
0 missing
hoehenumeric923 unique values
0 missing
sparkontnumeric5 unique values
0 missing
beszeitnumeric5 unique values
0 missing
ratenumeric4 unique values
0 missing
famgesnumeric4 unique values
0 missing
buergenumeric3 unique values
0 missing
wohnzeitnumeric4 unique values
0 missing
vermnumeric4 unique values
0 missing
alternumeric53 unique values
0 missing
weitkrednumeric3 unique values
0 missing
wohnnumeric3 unique values
0 missing
bishkrednumeric4 unique values
0 missing
berufnumeric4 unique values
0 missing
persnumeric2 unique values
0 missing
telefnumeric2 unique values
0 missing
gastarbnumeric2 unique values
0 missing
kreditnumeric2 unique values
0 missing

19 properties

1000
Number of instances (rows) of the dataset.
21
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
21
Number of numeric attributes.
0
Number of nominal attributes.
0.02
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task