Data
OpenML
Help
Sign in
×
Sign in
No account? Join OpenML
Forgot password
×
JavaScript is required to properly view the contents of this page!
OpenML
Explore
Data
Task
Flow
Run
Study
Task type
Measure
People
Help
Blog
Contact
Please cite us
simulated_adult
ARFF
CSV
JSON
XML
RDF
simulated_adult
active
ARFF
CC BY 4.0
Visibility: public
Uploaded 10-11-2023 by
Sebastian Fischer
0 likes
downloaded by 0 people , 0 total downloads
0 issues
0 downvotes
study_440
study_441
Add tag
Issue
#Downvotes for this reason
By
Loading wiki
Help us complete this description
Edit
See [https://github.com/slds-lmu/paper_2023_ci_for_ge](https://github.com/slds-lmu/paper_2023_ci_for_ge) for a description.
15 features
class
(target)
nominal
2 unique values
0 missing
age
numeric
76 unique values
0 missing
workclass
nominal
7 unique values
0 missing
fnlwgt
numeric
34776 unique values
0 missing
education
nominal
16 unique values
0 missing
education-num
numeric
19 unique values
0 missing
marital-status
nominal
7 unique values
0 missing
occupation
nominal
14 unique values
0 missing
relationship
nominal
6 unique values
0 missing
race
nominal
5 unique values
0 missing
sex
nominal
2 unique values
0 missing
capital-gain
numeric
121 unique values
0 missing
capital-loss
numeric
91 unique values
0 missing
hours-per-week
numeric
91 unique values
0 missing
native-country
nominal
41 unique values
0 missing
Show all 15 features
19 properties
NumberOfInstances
5100000
Number of instances (rows) of the dataset.
NumberOfFeatures
15
Number of attributes (columns) of the dataset.
NumberOfClasses
2
Number of distinct values of the target attribute (if it is nominal).
NumberOfMissingValues
0
Number of missing values in the dataset.
NumberOfInstancesWithMissingValues
0
Number of instances with at least one value missing.
NumberOfNumericFeatures
6
Number of numeric attributes.
NumberOfSymbolicFeatures
9
Number of nominal attributes.
PercentageOfBinaryFeatures
13.33
Percentage of binary attributes.
PercentageOfInstancesWithMissingValues
0
Percentage of instances having missing values.
AutoCorrelation
0.63
Average class difference between consecutive instances.
PercentageOfMissingValues
0
Percentage of missing values.
Dimensionality
0
Number of attributes divided by the number of instances.
PercentageOfNumericFeatures
40
Percentage of numeric attributes.
MajorityClassPercentage
75.14
Percentage of instances belonging to the most frequent class.
PercentageOfSymbolicFeatures
60
Percentage of nominal attributes.
MajorityClassSize
3832034
Number of instances belonging to the most frequent class.
MinorityClassPercentage
24.86
Percentage of instances belonging to the least frequent class.
MinorityClassSize
1267966
Number of instances belonging to the least frequent class.
NumberOfBinaryFeatures
2
Number of binary attributes.
Show all 19 properties
1 tasks
Supervised Classification on simulated_adult
0 runs
- estimation_procedure: 10-fold Crossvalidation - target_feature: class
Define a new task