Data
space_ga

space_ga

active ARFF Publicly available Visibility: public Uploaded 26-05-2023 by Sebastian Fischer
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Data Description The dataset contains 3,107 observations on U.S. county votes cast in the 1980 presidential election. Given population, education levels in the population, number of owned houses, incomes and coordinates, the goal is to predict the number of votes per population (logarithmic). Attribute Description 1. *ln_votes_pop* - logarithm of total number of votes - logarithm of populaiton (ln(votes/pop)), target feature 2. *pop* - the population in each county of 18 years of age or older 3. *education* - the population in each county with a 12th grade or higher education 4. *houses* - the number of owner-occupied housing units 5. *income* - the aggregate income 6. *xcoord* - X spatial coordinate of the county 7. *ycoord* - Y spatial coordinate of the county

7 features

ln_votes_pop (target)numeric3105 unique values
0 missing
popnumeric3001 unique values
0 missing
educationnumeric2914 unique values
0 missing
housesnumeric2832 unique values
0 missing
incomenumeric3097 unique values
0 missing
xcoordnumeric3107 unique values
0 missing
ycoordnumeric3106 unique values
0 missing

19 properties

3107
Number of instances (rows) of the dataset.
7
Number of attributes (columns) of the dataset.
0
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
7
Number of numeric attributes.
0
Number of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0.86
Average class difference between consecutive instances.
0
Percentage of missing values.
0
Number of attributes divided by the number of instances.
100
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.

1 tasks

0 runs - estimation_procedure: 10-fold Crossvalidation - target_feature: ln_votes_pop
Define a new task