Data
Crowdedness-at-the-Campus-Gym

Crowdedness-at-the-Campus-Gym

active ARFF Database: Open Database, Contents: Database Contents Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Background When is my university campus gym least crowded, so I know when to work out? We measured how many people were in this gym once every 10 minutes over the last year. We want to be able to predict how crowded the gym will be in the future. Goals Given a time of day (and maybe some other features, including weather), predict how crowded the gym will be. Figure out which features are actually important, which are redundant, and what features could be added to make the predictions more accurate. Data The dataset consists of 26,000 people counts (about every 10 minutes) over the last year. In addition, I gathered extra info including weather and semester-specific information that might affect how crowded it is. The label is the number of people, which I'd like to predict given some subset of the features. Label: Number of people Features: date (string; datetime of data) timestamp (int; number of seconds since beginning of day) dayofweek (int; 0 [monday] - 6 [sunday]) is_weekend (int; 0 or 1) [boolean, if 1, it's either saturday or sunday, otherwise 0] is_holiday (int; 0 or 1) [boolean, if 1 it's a federal holiday, 0 otherwise] temperature (float; degrees fahrenheit) isstartof_semester (int; 0 or 1) [boolean, if 1 it's the beginning of a school semester, 0 otherwise] month (int; 1 [jan] - 12 [dec]) hour (int; 0 - 23) Acknowledgements This data was collected with the consent of the university and the gym in question.

11 features

number_peoplenumeric128 unique values
0 missing
datestring62184 unique values
0 missing
timestampnumeric31321 unique values
0 missing
day_of_weeknumeric7 unique values
0 missing
is_weekendnumeric2 unique values
0 missing
is_holidaynumeric2 unique values
0 missing
temperaturenumeric2599 unique values
0 missing
is_start_of_semesternumeric2 unique values
0 missing
is_during_semesternumeric2 unique values
0 missing
monthnumeric12 unique values
0 missing
hournumeric24 unique values
0 missing

19 properties

62184
Number of instances (rows) of the dataset.
11
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
10
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
90.91
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
Average class difference between consecutive instances.
0
Percentage of missing values.

0 tasks

Define a new task