Data
EDA-Home-Mortgage-NY-Sampled

EDA-Home-Mortgage-NY-Sampled

active ARFF Publicly available Visibility: public Uploaded 12-12-2024 by B Gun
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Dataset is uploaded from kaggle. https://www.kaggle.com/code/ambarish/eda-home-mortgage-ny-with-feature-analysis/script

30 features

action_taken (target)nominal6 unique values
0 missing
agency_codenumeric6 unique values
0 missing
applicant_ethnicitynumeric4 unique values
0 missing
applicant_income_000snumeric1186 unique values
6212 missing
applicant_race_1numeric7 unique values
0 missing
applicant_sexnumeric4 unique values
0 missing
application_date_indicatornumeric2 unique values
0 missing
census_tract_numbernumeric2597 unique values
190 missing
co_applicant_ethnicitynumeric5 unique values
0 missing
co_applicant_race_1numeric8 unique values
0 missing
co_applicant_sexnumeric5 unique values
0 missing
county_codenumeric62 unique values
131 missing
denial_reason_1numeric9 unique values
38157 missing
hoepa_statusnumeric2 unique values
0 missing
lien_statusnumeric4 unique values
0 missing
loan_purposenumeric3 unique values
0 missing
loan_typenumeric4 unique values
0 missing
msamdnumeric14 unique values
3645 missing
owner_occupancynumeric3 unique values
0 missing
preapprovalnumeric3 unique values
0 missing
property_typenumeric3 unique values
0 missing
purchaser_typenumeric9 unique values
0 missing
sequence_numbernumeric28134 unique values
0 missing
hud_median_family_incomenumeric15 unique values
190 missing
loan_amount_000snumeric1627 unique values
0 missing
number_of_1_to_4_family_unitsnumeric2133 unique values
254 missing
number_of_owner_occupied_unitsnumeric1831 unique values
235 missing
minority_populationnumeric3230 unique values
193 missing
populationnumeric3242 unique values
193 missing
tract_to_msamd_incomenumeric3805 unique values
205 missing

19 properties

43965
Number of instances (rows) of the dataset.
30
Number of attributes (columns) of the dataset.
6
Number of distinct values of the target attribute (if it is nominal).
49605
Number of missing values in the dataset.
39152
Number of instances with at least one value missing.
29
Number of numeric attributes.
1
Number of nominal attributes.
0
Percentage of binary attributes.
89.05
Percentage of instances having missing values.
3.76
Percentage of missing values.
1
Average class difference between consecutive instances.
96.67
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
3.33
Percentage of nominal attributes.
51.87
Percentage of instances belonging to the most frequent class.
22805
Number of instances belonging to the most frequent class.
3.23
Percentage of instances belonging to the least frequent class.
1418
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

1 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: action_taken
Define a new task