Data
NYC-Uber-Pickups-with-Weather-and-Holidays

NYC-Uber-Pickups-with-Weather-and-Holidays

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Dustin Carrion
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Context This is a forked subset of the Uber Pickups in New York City, enriched with weather, borough, and holidays information. Content I created this dataset for a personal project of exploring and predicting pickups in the area by merging data that intuitively seem possible factors for the analysis. These are: Uber Pickups in New York City, from 01/01/2015 to 30/06/2015 (uber-raw-data-janjune-15.csv). (by FiveThirtyEight via kaggle.com) Weather data from National Centers for Environmental Information. LocationID to Borough mapping. (by FiveThirtyEight) NYC public holidays. The main dataset contained over 10 million observations of 4 variables which aggregated per hour and borough, and then joined with the rest of the datasets producing 29,101 observations across 13 variables. These are: pickup_dt: Time period of the observations. borough: NYC's borough. pickups: Number of pickups for the period. spd: Wind speed in miles/hour. vsb: Visibility in Miles to nearest tenth. temp: temperature in Fahrenheit. dewp: Dew point in Fahrenheit. slp: Sea level pressure. pcp01: 1-hour liquid precipitation. pcp06: 6-hour liquid precipitation. pcp24: 24-hour liquid precipitation. sd: Snow depth in inches. hday: Being a holiday (Y) or not (N). Acknowledgements / Original datasets: Uber Pickups in New York City, from 01/01/2015 to 30/06/2015 (uber-raw-data-janjune-15.csv). (by FiveThirtyEight via kaggle.com) Weather data from National Centers for Environmental Information. LocationID to Borough mapping. (by FiveThirtyEight) (Picture credits: Buck Ennis)

13 features

pickup_dtstring4343 unique values
0 missing
boroughstring6 unique values
3043 missing
pickupsnumeric3406 unique values
0 missing
spdnumeric114 unique values
0 missing
vsbnumeric179 unique values
0 missing
tempnumeric295 unique values
0 missing
dewpnumeric305 unique values
0 missing
slpnumeric413 unique values
0 missing
pcp01numeric80 unique values
0 missing
pcp06numeric318 unique values
0 missing
pcp24numeric484 unique values
0 missing
sdnumeric421 unique values
0 missing
hdaystring2 unique values
0 missing

19 properties

29101
Number of instances (rows) of the dataset.
13
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
3043
Number of missing values in the dataset.
3043
Number of instances with at least one value missing.
10
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
76.92
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
10.46
Percentage of instances having missing values.
Average class difference between consecutive instances.
0.8
Percentage of missing values.

0 tasks

Define a new task