OpenML
Measles-Immunization-Rates-in-US-Schools

Measles-Immunization-Rates-in-US-Schools

active ARFF CC BY-SA 4.0 Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
  • Computer Systems Machine Learning
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context This data set contains measles vaccination rate data for 46,412 schools in 32 states across the US. Content Vaccination rates are for the 2017-201818 school year for the following states: Colorado Connecticut Minnesota Montana New Jersey New York North Dakota Pennsylvania South Dakota Utah Washington Rates for other states are for the time period 2018-2019. The data was compiled by The Wall Street Journal. Acknowledgements The data was originally compiled by The Wall Street Journal, and then downloaded and wrangled by the TidyTuesday community. The R code used for wrangling can be accessed here. Inspiration Please remember that you are welcome to explore beyond the provided data set, but the data is provided as a "toy" data set to practice techniques on. The data may require additional cleaning and wrangling!

15 features

index (ignore)numeric8066 unique values
0 missing
statestring32 unique values
0 missing
yearstring3 unique values
5681 missing
namestring36129 unique values
0 missing
typestring6 unique values
36622 missing
citystring5665 unique values
20071 missing
countystring1157 unique values
6255 missing
districtnumeric0 unique values
66113 missing
enrollnumeric966 unique values
16260 missing
mmrnumeric4796 unique values
0 missing
overallnumeric2691 unique values
0 missing
xrelnominal1 unique values
66004 missing
xmednumeric1126 unique values
45122 missing
xpernumeric1591 unique values
57560 missing
latnumeric44562 unique values
1549 missing
lngnumeric44545 unique values
1549 missing

19 properties

66113
Number of instances (rows) of the dataset.
15
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
322786
Number of missing values in the dataset.
66113
Number of instances with at least one value missing.
8
Number of numeric attributes.
1
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
53.33
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
6.67
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
1
Number of binary attributes.
6.67
Percentage of binary attributes.
100
Percentage of instances having missing values.
Average class difference between consecutive instances.
32.55
Percentage of missing values.

0 tasks

Define a new task