Data
adult

adult

active ARFF public Visibility: public Uploaded 31-05-2022 by Mine Gazioglu
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computational Universe Life Science
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Predict whether income exceeds $50K/yr based on census data. Also known as Census Income dataset. Train and test sets combined. Null values represented with question mark is replaced with na. 52 duplicate values found and dropped

15 features

class (target)nominal2 unique values
0 missing
agenumeric74 unique values
0 missing
workclassnominal8 unique values
2795 missing
fnlwgtnumeric28523 unique values
0 missing
educationnominal16 unique values
0 missing
education_numnumeric16 unique values
0 missing
marital_statusnominal7 unique values
0 missing
occupationnominal14 unique values
2805 missing
relationshipnominal6 unique values
0 missing
racenominal5 unique values
0 missing
sexnominal2 unique values
0 missing
capital_gainnumeric123 unique values
0 missing
capital_lossnumeric99 unique values
0 missing
hours_per_weeknumeric96 unique values
0 missing
native_countrynominal41 unique values
856 missing

19 properties

48790
Number of instances (rows) of the dataset.
15
Number of attributes (columns) of the dataset.
2
Number of distinct values of the target attribute (if it is nominal).
6456
Number of missing values in the dataset.
3615
Number of instances with at least one value missing.
6
Number of numeric attributes.
9
Number of nominal attributes.
23.94
Percentage of instances belonging to the least frequent class.
11681
Number of instances belonging to the least frequent class.
2
Number of binary attributes.
13.33
Percentage of binary attributes.
7.41
Percentage of instances having missing values.
0.63
Average class difference between consecutive instances.
0.88
Percentage of missing values.
0
Number of attributes divided by the number of instances.
40
Percentage of numeric attributes.
60
Percentage of nominal attributes.
76.06
Percentage of instances belonging to the most frequent class.
37109
Number of instances belonging to the most frequent class.

0 tasks

Define a new task