Data
Credit-Card-Dataset-for-Clustering

Credit-Card-Dataset-for-Clustering

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
This case requires to develop a customer segmentation to define marketing strategy. The sample Dataset summarizes the usage behavior of about 9000 active credit card holders during the last 6 months. The file is at a customer level with 18 behavioral variables. Following is the Data Dictionary for Credit Card dataset :- CUSTID : Identification of Credit Card holder (Categorical) BALANCE : Balance amount left in their account to make purchases ( BALANCEFREQUENCY : How frequently the Balance is updated, score between 0 and 1 (1 = frequently updated, 0 = not frequently updated) PURCHASES : Amount of purchases made from account ONEOFFPURCHASES : Maximum purchase amount done in one-go INSTALLMENTSPURCHASES : Amount of purchase done in installment CASHADVANCE : Cash in advance given by the user PURCHASESFREQUENCY : How frequently the Purchases are being made, score between 0 and 1 (1 = frequently purchased, 0 = not frequently purchased) ONEOFFPURCHASESFREQUENCY : How frequently Purchases are happening in one-go (1 = frequently purchased, 0 = not frequently purchased) PURCHASESINSTALLMENTSFREQUENCY : How frequently purchases in installments are being done (1 = frequently done, 0 = not frequently done) CASHADVANCEFREQUENCY : How frequently the cash in advance being paid CASHADVANCETRX : Number of Transactions made with "Cash in Advanced" PURCHASESTRX : Numbe of purchase transactions made CREDITLIMIT : Limit of Credit Card for user PAYMENTS : Amount of Payment done by user MINIMUM_PAYMENTS : Minimum amount of payments made by user PRCFULLPAYMENT : Percent of full payment paid by user TENURE : Tenure of credit card service for user

17 features

CUST_ID (ignore)string8950 unique values
0 missing
BALANCEnumeric8871 unique values
0 missing
BALANCE_FREQUENCYnumeric43 unique values
0 missing
PURCHASESnumeric6203 unique values
0 missing
ONEOFF_PURCHASESnumeric4014 unique values
0 missing
INSTALLMENTS_PURCHASESnumeric4452 unique values
0 missing
CASH_ADVANCEnumeric4323 unique values
0 missing
PURCHASES_FREQUENCYnumeric47 unique values
0 missing
ONEOFF_PURCHASES_FREQUENCYnumeric47 unique values
0 missing
PURCHASES_INSTALLMENTS_FREQUENCYnumeric47 unique values
0 missing
CASH_ADVANCE_FREQUENCYnumeric54 unique values
0 missing
CASH_ADVANCE_TRXnumeric65 unique values
0 missing
PURCHASES_TRXnumeric173 unique values
0 missing
CREDIT_LIMITnumeric205 unique values
1 missing
PAYMENTSnumeric8711 unique values
0 missing
MINIMUM_PAYMENTSnumeric8636 unique values
313 missing
PRC_FULL_PAYMENTnumeric47 unique values
0 missing
TENUREnumeric7 unique values
0 missing

19 properties

8950
Number of instances (rows) of the dataset.
17
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
314
Number of missing values in the dataset.
314
Number of instances with at least one value missing.
17
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
3.51
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.21
Percentage of missing values.

0 tasks

Define a new task