Data
KDD98

KDD98

active ARFF Publicly available Visibility: public Uploaded 03-04-2020 by Florian Pargent
0 likes downloaded by 1 people , 1 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Dataset KDD98 challenge: https://kdd.ics.uci.edu/databases/kddcup98/kddcup98.html The goal is to estimate the return from a direct mailing in order to maximize donation profits. This dataset represents problem of binary classification - whether there was a response to mailing. For this version, the target was correctly encoded as a binary factor. The features 'HPHONE_D', 'MHUC2', 'INCOME', 'WEALTH1', 'WEALTH2' were recoded as nominal factor variables and the constant feature 'RFA_2R' was removed from the dataset. For this version, the majority class was downsampled to 40% of the original size. Unused factor levels were dropped.

478 features

TARGET_B (target)nominal2 unique values
0 missing
AC1numeric37 unique values
0 missing
LASTGIFTnumeric220 unique values
0 missing
AC2numeric40 unique values
0 missing
LFC1numeric99 unique values
0 missing
ADATE_10nominal3 unique values
0 missing
LFC10numeric89 unique values
0 missing
ADATE_11nominal5 unique values
0 missing
LFC2numeric100 unique values
0 missing
ADATE_12nominal5 unique values
0 missing
LFC3numeric98 unique values
0 missing
ADATE_13nominal4 unique values
0 missing
LFC4numeric100 unique values
0 missing
ADATE_14nominal3 unique values
0 missing
LFC5numeric98 unique values
0 missing
ADATE_15nominal2 unique values
0 missing
LFC6numeric96 unique values
0 missing
ADATE_16nominal4 unique values
0 missing
LFC7numeric99 unique values
0 missing
ADATE_17nominal4 unique values
0 missing
LFC8numeric99 unique values
0 missing
ADATE_18nominal10 unique values
0 missing
LFC9numeric96 unique values
0 missing
ADATE_19nominal4 unique values
0 missing
LIFESRCnominal4 unique values
0 missing
ADATE_2nominal2 unique values
0 missing
LOCALGOVnumeric56 unique values
0 missing
ADATE_20nominal3 unique values
0 missing
LSC1numeric100 unique values
0 missing
ADATE_21nominal3 unique values
0 missing
LSC2numeric100 unique values
0 missing
ADATE_22nominal6 unique values
0 missing
LSC3numeric73 unique values
0 missing
ADATE_23nominal4 unique values
0 missing
LSC4numeric76 unique values
0 missing
ADATE_24nominal3 unique values
0 missing
MAGFAMLnumeric9 unique values
45452 missing
ADATE_3nominal3 unique values
0 missing
MAGFEMnumeric5 unique values
45452 missing
ADATE_4nominal9 unique values
0 missing
MAGMALEnumeric5 unique values
45452 missing
ADATE_5nominal2 unique values
0 missing
MAILCODEnominal2 unique values
0 missing
ADATE_6nominal3 unique values
0 missing
MAJORnominal2 unique values
0 missing
ADATE_7nominal4 unique values
0 missing
MALEMILInumeric94 unique values
0 missing
ADATE_8nominal6 unique values
0 missing
MALEVETnumeric88 unique values
0 missing
ADATE_9nominal4 unique values
0 missing
MARR1numeric94 unique values
0 missing
ADInominal202 unique values
0 missing
MARR2numeric50 unique values
0 missing
AFC1numeric83 unique values
0 missing
MARR3numeric63 unique values
0 missing
AFC2numeric93 unique values
0 missing
MARR4numeric100 unique values
0 missing
AFC3numeric39 unique values
0 missing
MAXADATEnominal6 unique values
0 missing
AFC4numeric61 unique values
0 missing
MAXRAMNTnumeric275 unique values
0 missing
AFC5numeric88 unique values
0 missing
MAXRDATEnominal143 unique values
0 missing
AFC6numeric29 unique values
0 missing
MBBOOKSnumeric10 unique values
45452 missing
AGEnumeric92 unique values
20371 missing
MBCOLECTnumeric7 unique values
45499 missing
AGE901numeric72 unique values
0 missing
MBCRAFTnumeric7 unique values
45452 missing
AGE902numeric66 unique values
0 missing
MBGARDENnumeric4 unique values
45452 missing
AGE903numeric58 unique values
0 missing
MC1numeric98 unique values
0 missing
AGE904numeric65 unique values
0 missing
MC2numeric99 unique values
0 missing
AGE905numeric60 unique values
0 missing
MC3numeric96 unique values
0 missing
AGE906numeric53 unique values
0 missing
MDMAUDnominal26 unique values
0 missing
AGE907numeric60 unique values
0 missing
MDMAUD_Anominal5 unique values
0 missing
AGEC1numeric98 unique values
0 missing
MDMAUD_Fnominal4 unique values
0 missing
AGEC2numeric70 unique values
0 missing
MDMAUD_Rnominal5 unique values
0 missing
AGEC3numeric56 unique values
0 missing
MHUC1numeric22 unique values
0 missing
AGEC4numeric39 unique values
0 missing
MHUC2nominal6 unique values
0 missing
AGEC5numeric41 unique values
0 missing
MINRAMNTnumeric193 unique values
0 missing
AGEC6numeric62 unique values
0 missing
MINRDATEnominal145 unique values
0 missing
AGEC7numeric75 unique values
0 missing
MSAnominal297 unique values
0 missing
AGEFLAGnominal3 unique values
0 missing
NEXTDATEnominal178 unique values
0 missing
ANC1numeric65 unique values
0 missing
NGIFTALLnumeric84 unique values
0 missing
ANC10numeric62 unique values
0 missing
NOEXCHnominal4 unique values
0 missing
ANC11numeric30 unique values
0 missing
NUMCHLDnumeric7 unique values
71727 missing
ANC12numeric29 unique values
0 missing
NUMPRM12numeric61 unique values
0 missing
ANC13numeric19 unique values
0 missing
NUMPROMnumeric159 unique values
0 missing
ANC14numeric25 unique values
0 missing
OCC1numeric70 unique values
0 missing
ANC15numeric18 unique values
0 missing
OCC10numeric57 unique values
0 missing
ANC2numeric55 unique values
0 missing
OCC11numeric57 unique values
0 missing
ANC3numeric28 unique values
0 missing
OCC12numeric44 unique values
0 missing
ANC4numeric81 unique values
0 missing
OCC13numeric45 unique values
0 missing
ANC5numeric21 unique values
0 missing
OCC2numeric61 unique values
0 missing
ANC6numeric14 unique values
0 missing
OCC3numeric35 unique values
0 missing
ANC7numeric43 unique values
0 missing
OCC4numeric59 unique values
0 missing
ANC8numeric42 unique values
0 missing
OCC5numeric60 unique values
0 missing
ANC9numeric47 unique values
0 missing
OCC6numeric27 unique values
0 missing
AVGGIFTnumeric7300 unique values
0 missing
OCC7numeric38 unique values
0 missing
BIBLEnominal2 unique values
0 missing
OCC8numeric67 unique values
0 missing
BOATSnominal2 unique values
0 missing
OCC9numeric69 unique values
0 missing
CARDGIFTnumeric35 unique values
0 missing
ODATEDWnominal52 unique values
0 missing
CARDPM12numeric22 unique values
0 missing
OEDC1numeric55 unique values
0 missing
CARDPROMnumeric59 unique values
0 missing
OEDC2numeric62 unique values
0 missing
CARDSnominal2 unique values
0 missing
OEDC3numeric56 unique values
0 missing
CATLGnominal2 unique values
0 missing
OEDC4numeric63 unique values
0 missing
CDPLAYnominal2 unique values
0 missing
OEDC5numeric90 unique values
0 missing
CHIL1numeric83 unique values
0 missing
OEDC6numeric60 unique values
0 missing
CHIL2numeric69 unique values
0 missing
OEDC7numeric23 unique values
0 missing
CHIL3numeric77 unique values
0 missing
OSOURCEnominal888 unique values
0 missing
CHILC1numeric64 unique values
0 missing
PCOWNERSnominal2 unique values
0 missing
CHILC2numeric47 unique values
0 missing
PEC1numeric83 unique values
0 missing
CHILC3numeric62 unique values
0 missing
PEC2numeric99 unique values
0 missing
CHILC4numeric53 unique values
0 missing
PEPSTRFLnominal2 unique values
0 missing
CHILC5numeric97 unique values
0 missing
PETSnominal2 unique values
0 missing
CHILD03nominal4 unique values
0 missing
PHOTOnominal2 unique values
0 missing
CHILD07nominal4 unique values
0 missing
PLATESnominal2 unique values
0 missing
CHILD12nominal4 unique values
0 missing
POBC1numeric89 unique values
0 missing
CHILD18nominal4 unique values
0 missing
POBC2numeric100 unique values
0 missing
CLUSTERnominal54 unique values
0 missing
POP901numeric9492 unique values
0 missing
CLUSTER2numeric62 unique values
0 missing
POP902numeric4585 unique values
0 missing
COLLECT1nominal2 unique values
0 missing
POP903numeric5474 unique values
0 missing
CRAFTSnominal2 unique values
0 missing
POP90C1numeric100 unique values
0 missing
DATASRCEnominal4 unique values
0 missing
POP90C2numeric100 unique values
0 missing
DMAnominal204 unique values
0 missing
POP90C3numeric100 unique values
0 missing
DOBnominal924 unique values
0 missing
POP90C4numeric81 unique values
0 missing
DOMAINnominal17 unique values
0 missing
POP90C5numeric81 unique values
0 missing
DW1numeric100 unique values
0 missing
PUBCULINnumeric6 unique values
45452 missing
DW2numeric100 unique values
0 missing
PUBDOITYnumeric10 unique values
45452 missing
DW3numeric68 unique values
0 missing
PUBGARDNnumeric7 unique values
45452 missing
DW4numeric100 unique values
0 missing
PUBHLTHnumeric10 unique values
45452 missing
DW5numeric100 unique values
0 missing
PUBNEWFNnumeric10 unique values
45452 missing
DW6numeric100 unique values
0 missing
PUBOPPnumeric10 unique values
45452 missing
DW7numeric97 unique values
0 missing
PUBPHOTOnumeric3 unique values
45452 missing
DW8numeric88 unique values
0 missing
PVASTATEnominal3 unique values
0 missing
DW9numeric90 unique values
0 missing
RAMNTALLnumeric1989 unique values
0 missing
EC1numeric79 unique values
0 missing
RAMNT_10numeric85 unique values
73394 missing
EC2numeric75 unique values
0 missing
RAMNT_11numeric90 unique values
69428 missing
EC3numeric62 unique values
0 missing
RAMNT_12numeric104 unique values
59875 missing
EC4numeric78 unique values
0 missing
RAMNT_13numeric79 unique values
71421 missing
EC5numeric62 unique values
0 missing
RAMNT_14numeric96 unique values
61686 missing
EC6numeric37 unique values
0 missing
RAMNT_15numeric66 unique values
75994 missing
EC7numeric63 unique values
0 missing
RAMNT_16numeric98 unique values
58778 missing
EC8numeric72 unique values
0 missing
RAMNT_17numeric67 unique values
74366 missing
EIC1numeric75 unique values
0 missing
RAMNT_18numeric94 unique values
65050 missing
EIC10numeric44 unique values
0 missing
RAMNT_19numeric77 unique values
68547 missing
EIC11numeric50 unique values
0 missing
RAMNT_20numeric62 unique values
75472 missing
EIC12numeric39 unique values
0 missing
RAMNT_21numeric75 unique values
74008 missing
EIC13numeric57 unique values
0 missing
RAMNT_22numeric86 unique values
63978 missing
EIC14numeric65 unique values
0 missing
RAMNT_23numeric68 unique values
75473 missing
EIC15numeric55 unique values
0 missing
RAMNT_24numeric83 unique values
66702 missing
EIC16numeric51 unique values
0 missing
RAMNT_3numeric28 unique values
82090 missing
EIC2numeric51 unique values
0 missing
RAMNT_4numeric32 unique values
82049 missing
EIC3numeric51 unique values
0 missing
RAMNT_5numeric8 unique values
82304 missing
EIC4numeric74 unique values
0 missing
RAMNT_6numeric37 unique values
81608 missing
EIC5numeric40 unique values
0 missing
RAMNT_7numeric76 unique values
74721 missing
EIC6numeric33 unique values
0 missing
RAMNT_8numeric103 unique values
63433 missing
EIC7numeric44 unique values
0 missing
RAMNT_9numeric82 unique values
67534 missing
EIC8numeric71 unique values
0 missing
RDATE_10nominal10 unique values
0 missing
EIC9numeric51 unique values
0 missing
RDATE_11nominal13 unique values
0 missing
ETH1numeric100 unique values
0 missing
RDATE_12nominal10 unique values
0 missing
ETH10numeric33 unique values
0 missing
RDATE_13nominal15 unique values
0 missing
ETH11numeric34 unique values
0 missing
RDATE_14nominal13 unique values
0 missing
ETH12numeric42 unique values
0 missing
RDATE_15nominal17 unique values
0 missing
ETH13numeric97 unique values
0 missing
RDATE_16nominal18 unique values
0 missing
ETH14numeric45 unique values
0 missing
RDATE_17nominal12 unique values
0 missing
ETH15numeric80 unique values
0 missing
RDATE_18nominal16 unique values
0 missing
ETH16numeric62 unique values
0 missing
RDATE_19nominal14 unique values
0 missing
ETH2numeric100 unique values
0 missing
RDATE_20nominal11 unique values
0 missing
ETH3numeric84 unique values
0 missing
RDATE_21nominal13 unique values
0 missing
ETH4numeric94 unique values
0 missing
RDATE_22nominal13 unique values
0 missing
ETH5numeric100 unique values
0 missing
RDATE_23nominal18 unique values
0 missing
ETH6numeric22 unique values
0 missing
RDATE_24nominal15 unique values
0 missing
ETH7numeric59 unique values
0 missing
RDATE_3nominal16 unique values
0 missing
ETH8numeric60 unique values
0 missing
RDATE_4nominal24 unique values
0 missing
ETH9numeric56 unique values
0 missing
RDATE_5nominal7 unique values
0 missing
ETHC1numeric52 unique values
0 missing
RDATE_6nominal18 unique values
0 missing
ETHC2numeric95 unique values
0 missing
RDATE_7nominal11 unique values
0 missing
ETHC3numeric99 unique values
0 missing
RDATE_8nominal16 unique values
0 missing
ETHC4numeric45 unique values
0 missing
RDATE_9nominal11 unique values
0 missing
ETHC5numeric81 unique values
0 missing
RECINHSEnominal2 unique values
0 missing
ETHC6numeric48 unique values
0 missing
RECP3nominal2 unique values
0 missing
FEDGOVnumeric57 unique values
0 missing
RECPGVGnominal2 unique values
0 missing
FISHERnominal2 unique values
0 missing
RECSWEEPnominal2 unique values
0 missing
FISTDATEnominal173 unique values
0 missing
RFA_10nominal93 unique values
0 missing
GARDENINnominal2 unique values
0 missing
RFA_11nominal102 unique values
0 missing
GENDERnominal6 unique values
0 missing
RFA_12nominal107 unique values
0 missing
GEOCODEnominal8 unique values
0 missing
RFA_13nominal85 unique values
0 missing
GEOCODE2nominal5 unique values
0 missing
RFA_14nominal99 unique values
0 missing
HC1numeric32 unique values
0 missing
RFA_15nominal35 unique values
0 missing
HC10numeric51 unique values
0 missing
RFA_16nominal122 unique values
0 missing
HC11numeric100 unique values
0 missing
RFA_17nominal117 unique values
0 missing
HC12numeric95 unique values
0 missing
RFA_18nominal122 unique values
0 missing
HC13numeric100 unique values
0 missing
RFA_19nominal107 unique values
0 missing
HC14numeric98 unique values
0 missing
RFA_2nominal14 unique values
0 missing
HC15numeric16 unique values
0 missing
RFA_20nominal80 unique values
0 missing
HC16numeric93 unique values
0 missing
RFA_21nominal102 unique values
0 missing
HC17numeric100 unique values
0 missing
RFA_22nominal116 unique values
0 missing
HC18numeric100 unique values
0 missing
RFA_23nominal85 unique values
0 missing
HC19numeric100 unique values
0 missing
RFA_24nominal97 unique values
0 missing
HC2numeric53 unique values
0 missing
RFA_2Anominal4 unique values
0 missing
HC20numeric57 unique values
0 missing
RFA_2Fnominal4 unique values
0 missing
HC21numeric71 unique values
0 missing
RFA_3nominal70 unique values
0 missing
HC3numeric84 unique values
0 missing
RFA_4nominal65 unique values
0 missing
HC4numeric100 unique values
0 missing
RFA_5nominal41 unique values
0 missing
HC5numeric100 unique values
0 missing
RFA_6nominal110 unique values
0 missing
HC6numeric100 unique values
0 missing
RFA_7nominal103 unique values
0 missing
HC7numeric100 unique values
0 missing
RFA_8nominal107 unique values
0 missing
HC8numeric100 unique values
0 missing
RFA_9nominal106 unique values
0 missing
HC9numeric89 unique values
0 missing
RHP1numeric80 unique values
0 missing
HHAGE1numeric99 unique values
0 missing
RHP2numeric80 unique values
0 missing
HHAGE2numeric86 unique values
0 missing
RHP3numeric26 unique values
0 missing
HHAGE3numeric100 unique values
0 missing
RHP4numeric23 unique values
0 missing
HHAS1numeric100 unique values
0 missing
RP1numeric100 unique values
0 missing
HHAS2numeric72 unique values
0 missing
RP2numeric100 unique values
0 missing
HHAS3numeric100 unique values
0 missing
RP3numeric100 unique values
0 missing
HHAS4numeric90 unique values
0 missing
RP4numeric100 unique values
0 missing
HHD1numeric95 unique values
0 missing
SEC1numeric82 unique values
0 missing
HHD10numeric80 unique values
0 missing
SEC2numeric95 unique values
0 missing
HHD11numeric81 unique values
0 missing
SEC3numeric18 unique values
0 missing
HHD12numeric63 unique values
0 missing
SEC4numeric54 unique values
0 missing
HHD2numeric100 unique values
0 missing
SEC5numeric97 unique values
0 missing
HHD3numeric100 unique values
0 missing
SOLIHnominal9 unique values
0 missing
HHD4numeric92 unique values
0 missing
SOLP3nominal5 unique values
0 missing
HHD5numeric100 unique values
0 missing
STATEnominal54 unique values
0 missing
HHD6numeric100 unique values
0 missing
STATEGOVnumeric62 unique values
0 missing
HHD7numeric64 unique values
0 missing
STEREOnominal2 unique values
0 missing
HHD8numeric18 unique values
0 missing
HHD9numeric62 unique values
0 missing
TCODEnominal50 unique values
0 missing
HHN1numeric97 unique values
0 missing
TIMELAGnumeric67 unique values
8339 missing
HHN2numeric83 unique values
0 missing
TPE1numeric99 unique values
0 missing
HHN3numeric97 unique values
0 missing
TPE10numeric64 unique values
0 missing
HHN4numeric84 unique values
0 missing
TPE11numeric62 unique values
0 missing
HHN5numeric69 unique values
0 missing
TPE12numeric58 unique values
0 missing
HHN6numeric54 unique values
0 missing
TPE13numeric100 unique values
0 missing
HHP1numeric390 unique values
0 missing
TPE2numeric68 unique values
0 missing
HHP2numeric381 unique values
0 missing
TPE3numeric73 unique values
0 missing
HITnumeric77 unique values
0 missing
TPE4numeric71 unique values
0 missing
HOMEEnominal2 unique values
0 missing
TPE5numeric50 unique values
0 missing
HOMEOWNRnominal3 unique values
0 missing
TPE6numeric29 unique values
0 missing
HPHONE_Dnominal2 unique values
0 missing
TPE7numeric18 unique values
0 missing
HU1numeric100 unique values
0 missing
TPE8numeric83 unique values
0 missing
HU2numeric100 unique values
0 missing
TPE9numeric54 unique values
0 missing
HU3numeric93 unique values
0 missing
VC1numeric95 unique values
0 missing
HU4numeric94 unique values
0 missing
VC2numeric86 unique values
0 missing
HU5numeric100 unique values
0 missing
VC3numeric100 unique values
0 missing
HUPA1numeric98 unique values
0 missing
VC4numeric96 unique values
0 missing
HUPA2numeric100 unique values
0 missing
VETERANSnominal2 unique values
0 missing
HUPA3numeric100 unique values
0 missing
VIETVETSnumeric95 unique values
0 missing
HUPA4numeric96 unique values
0 missing
VOC1numeric93 unique values
0 missing
HUPA5numeric82 unique values
0 missing
VOC2numeric100 unique values
0 missing
HUPA6numeric100 unique values
0 missing
VOC3numeric77 unique values
0 missing
HUPA7numeric48 unique values
0 missing
WALKERnominal2 unique values
0 missing
HUR1numeric95 unique values
0 missing
WEALTH1nominal10 unique values
38495 missing
HUR2numeric100 unique values
0 missing
WEALTH2nominal10 unique values
37578 missing
HV1numeric4338 unique values
0 missing
WWIIVETSnumeric100 unique values
0 missing
HV2numeric4504 unique values
0 missing
ZIPnominal18543 unique values
0 missing
HV3numeric14 unique values
0 missing
HV4numeric14 unique values
0 missing
HVP1numeric100 unique values
0 missing
HVP2numeric100 unique values
0 missing
HVP3numeric100 unique values
0 missing
HVP4numeric100 unique values
0 missing
HVP5numeric100 unique values
0 missing
HVP6numeric100 unique values
0 missing
IC1numeric1104 unique values
0 missing
IC10numeric65 unique values
0 missing
IC11numeric53 unique values
0 missing
IC12numeric39 unique values
0 missing
IC13numeric28 unique values
0 missing
IC14numeric73 unique values
0 missing
IC15numeric96 unique values
0 missing
IC16numeric75 unique values
0 missing
IC17numeric72 unique values
0 missing
IC18numeric71 unique values
0 missing
IC19numeric73 unique values
0 missing
IC2numeric1191 unique values
0 missing
IC20numeric58 unique values
0 missing
IC21numeric47 unique values
0 missing
IC22numeric34 unique values
0 missing
IC23numeric76 unique values
0 missing
IC3numeric1080 unique values
0 missing
IC4numeric1144 unique values
0 missing
IC5numeric20761 unique values
0 missing
IC6numeric100 unique values
0 missing
IC7numeric66 unique values
0 missing
IC8numeric61 unique values
0 missing
IC9numeric64 unique values
0 missing
INCOMEnominal7 unique values
18515 missing
KIDSTUFFnominal2 unique values
0 missing
LASTDATEnominal24 unique values
0 missing

19 properties

82318
Number of instances (rows) of the dataset.
478
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
2399311
Number of missing values in the dataset.
82318
Number of instances with at least one value missing.
341
Number of numeric attributes.
137
Number of nominal attributes.
72628
Number of instances belonging to the most frequent class.
11.77
Percentage of instances belonging to the least frequent class.
9690
Number of instances belonging to the least frequent class.
30
Number of binary attributes.
6.28
Percentage of binary attributes.
100
Percentage of instances having missing values.
1
Average class difference between consecutive instances.
6.1
Percentage of missing values.
0.01
Number of attributes divided by the number of instances.
71.34
Percentage of numeric attributes.
88.23
Percentage of instances belonging to the most frequent class.
28.66
Percentage of nominal attributes.

10 tasks

0 runs - estimation_procedure: 10% Holdout set - target_feature: TARGET_B
0 runs - estimation_procedure: 33% Holdout set - target_feature: TARGET_B
0 runs - estimation_procedure: 10-fold Crossvalidation - evaluation_measure: predictive_accuracy - target_feature: TARGET_B
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
0 runs - estimation_procedure: 50 times Clustering
Define a new task