Data
TVS_Loan_Default

TVS_Loan_Default

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Personal Loan product is an unsecured loan therefore it is vital to assess the risk of the customers by checking their credit worthiness. This must be done to prevent loan defaults. The objective is to build a Risk model using the dataset which will assess the risk of a customer defaulting after cross-selling the Personal Loan. Column Descriptions: V1: Customer ID V2: If a customer has bounced in first EMI (1 : Bounced, 0 : Not bounced) V3: Number of times bounced in recent 12 months V4: Maximum MOB (Month of business with TVS Credit) V5: Number of times bounced while repaying the loan V6: EMI V7: Loan Amount V8: Tenure V9: Dealer codes from where customer has purchased the Two wheeler V10: Product code of Two wheeler (MC : Motorcycle , MO : Moped, SC : Scooter) V11: No of advance EMI paid V12: Rate of interest V13: Gender (Male/Female) V14: Employment type (HOUSEWIFE : housewife, SELF : Self-employed, SAL : Salaried, PENS : Pensioner, STUDENT : Student) V15: Resident type of customer V16: Date of birth V17: Age at which customer has taken the loan V18: Number of loans V19: Number of secured loans V20: Number of unsecured loans V21: Maximum amount sanctioned in the Live loans V22: Number of new loans in last 3 months V23: Total sanctioned amount in the secured Loans which are Live V24: Total sanctioned amount in the unsecured Loans which are Live V25: Maximum amount sanctioned for any Two wheeler loan V26: Time since last Personal loan taken (in months) V27: Time since first consumer durables loan taken (in months) V28: Number of times 30 days past due in last 6 months V29: Number of times 60 days past due in last 6 months V30: Number of times 90 days past due in last 3 months V31: Tier ; (Customers geographical location) V32: Target variable ( 1: Defaulters / 0: Non-Defaulters)

32 features

V1numeric119528 unique values
0 missing
V2numeric2 unique values
0 missing
V3numeric13 unique values
0 missing
V4numeric34 unique values
34480 missing
V5numeric24 unique values
34480 missing
V6numeric3292 unique values
34480 missing
V7numeric6289 unique values
34480 missing
V8numeric31 unique values
34480 missing
V9numeric3250 unique values
34480 missing
V10string5 unique values
34480 missing
V11numeric7 unique values
34480 missing
V12numeric1024 unique values
34480 missing
V13string2 unique values
34480 missing
V14string5 unique values
34480 missing
V15string3 unique values
35397 missing
V16string13486 unique values
34480 missing
V17numeric49 unique values
34480 missing
V18numeric121 unique values
0 missing
V19numeric109 unique values
0 missing
V20numeric47 unique values
0 missing
V21numeric9900 unique values
82902 missing
V22numeric1 unique values
0 missing
V23numeric7272 unique values
100247 missing
V24numeric10414 unique values
100500 missing
V25numeric8681 unique values
15061 missing
V26numeric224 unique values
106097 missing
V27numeric246 unique values
99095 missing
V28numeric86 unique values
0 missing
V29numeric81 unique values
0 missing
V30numeric46 unique values
0 missing
V31string4 unique values
0 missing
V32numeric2 unique values
0 missing

19 properties

119528
Number of instances (rows) of the dataset.
32
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
987539
Number of missing values in the dataset.
117856
Number of instances with at least one value missing.
26
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
81.25
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
98.6
Percentage of instances having missing values.
Average class difference between consecutive instances.
25.82
Percentage of missing values.

0 tasks

Define a new task