Data
EDA-Home-Mortgage-NY

EDA-Home-Mortgage-NY

active ARFF Publicly available Visibility: public Uploaded 04-12-2024 by B Gun
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Dataset is uploaded from kaggle. https://www.kaggle.com/code/ambarish/eda-home-mortgage-ny-with-feature-analysis/script

30 features

action_taken (target)nominal7 unique values
0 missing
agency_codenumeric6 unique values
0 missing
applicant_ethnicitynumeric4 unique values
0 missing
applicant_income_000snumeric1459 unique values
12344 missing
applicant_race_1numeric7 unique values
0 missing
applicant_sexnumeric4 unique values
0 missing
application_date_indicatornumeric2 unique values
0 missing
census_tract_numbernumeric2625 unique values
351 missing
co_applicant_ethnicitynumeric5 unique values
0 missing
co_applicant_race_1numeric8 unique values
0 missing
co_applicant_sexnumeric5 unique values
0 missing
county_codenumeric62 unique values
247 missing
denial_reason_1numeric9 unique values
76281 missing
hoepa_statusnumeric2 unique values
0 missing
lien_statusnumeric4 unique values
0 missing
loan_purposenumeric3 unique values
0 missing
loan_typenumeric4 unique values
0 missing
msamdnumeric14 unique values
7170 missing
owner_occupancynumeric3 unique values
0 missing
preapprovalnumeric3 unique values
0 missing
property_typenumeric3 unique values
0 missing
purchaser_typenumeric10 unique values
0 missing
sequence_numbernumeric49497 unique values
0 missing
hud_median_family_incomenumeric15 unique values
351 missing
loan_amount_000snumeric2000 unique values
0 missing
number_of_1_to_4_family_unitsnumeric2156 unique values
459 missing
number_of_owner_occupied_unitsnumeric1842 unique values
418 missing
minority_populationnumeric3307 unique values
355 missing
populationnumeric3304 unique values
355 missing
tract_to_msamd_incomenumeric3904 unique values
373 missing

19 properties

87931
Number of instances (rows) of the dataset.
30
Number of attributes (columns) of the dataset.
7
Number of distinct values of the target attribute (if it is nominal).
98704
Number of missing values in the dataset.
78172
Number of instances with at least one value missing.
29
Number of numeric attributes.
1
Number of nominal attributes.
0
Percentage of binary attributes.
88.9
Percentage of instances having missing values.
3.74
Percentage of missing values.
1
Average class difference between consecutive instances.
96.67
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
3.33
Percentage of nominal attributes.
51.87
Percentage of instances belonging to the most frequent class.
45611
Number of instances belonging to the most frequent class.
0
Percentage of instances belonging to the least frequent class.
1
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

1 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: action_taken
Define a new task