Data
Eighty-years-of-Canadian-climate-data

Eighty-years-of-Canadian-climate-data

active ARFF CC0: Public Domain Visibility: public Uploaded 24-03-2022 by Elif Ceren Gok
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
This dataset has been compiled from public sources. The dataset consists of daily temperatures and precipitation from 13 Canadian centres. Precipitation is either rain or snow (likely snow in winter months). In 1940, there is daily data for seven out of the 13 centres, but by 1960 there is daily data from all 13 centres, with the occasional missing value. Few of Canadas weather stations have been operating continuously, so we did need to patch together the data. Our source data is from https://climate-change.canada.ca/climate-data//daily-climate-data and here are the weather stations that we queried: CALGARY INTL A CALGARY INT'L A EDMONTON INTL A EDMONTON INT'L A HALIFAX STANFIELD INT'L A HALIFAX STANFIELD INT'L A MONCTON A MONCTON A MONTREAL/PIERRE ELLIOTT TRUDEAU INTL MONTREAL/PIERRE ELLIOTT TRUDEAU INTL A OTTAWA INTL A OTTAWA MACDONALD-CARTIER INT'L A QUEBEC/JEAN LESAGE INTL QUEBEC/JEAN LESAGE INTL A SASKATOON DIEFENBAKER INT'L A SASKATOON INTL A ST JOHN'S A ST JOHNS WEST CLIMATE TORONTO INTL A TORONTO LESTER B. PEARSON INT'L A VANCOUVER INTL A VANCOUVER INT'L A WHITEHORSE A WHITEHORSE A WINNIPEG RICHARDSON INT'L A WINNIPEG THE FORKS Suggested uses: The data is suitable for time series forecasting. At Penny Analytics, we are using this dataset to demonstrate outlier detection (anomaly detection) in multiple time series and here is the relevant blogpost: https://pennyanalytics.com/2020/01/28/climate-change-canadian-weather-anomalies-are-now-warmer-and-wetter/

27 features

LOCAL_DATEstring29221 unique values
0 missing
MEAN_TEMPERATURE_CALGARYnumeric576 unique values
189 missing
TOTAL_PRECIPITATION_CALGARYnumeric318 unique values
173 missing
MEAN_TEMPERATURE_EDMONTONnumeric603 unique values
7657 missing
TOTAL_PRECIPITATION_EDMONTONnumeric297 unique values
7646 missing
MEAN_TEMPERATURE_HALIFAXnumeric472 unique values
7164 missing
TOTAL_PRECIPITATION_HALIFAXnumeric586 unique values
7226 missing
MEAN_TEMPERATURE_MONCTONnumeric510 unique values
2336 missing
TOTAL_PRECIPITATION_MONCTONnumeric484 unique values
2767 missing
MEAN_TEMPERATURE_MONTREALnumeric1050 unique values
755 missing
TOTAL_PRECIPITATION_MONTREALnumeric651 unique values
730 missing
MEAN_TEMPERATURE_OTTAWAnumeric569 unique values
76 missing
TOTAL_PRECIPITATION_OTTAWAnumeric401 unique values
81 missing
MEAN_TEMPERATURE_QUEBECnumeric595 unique values
1214 missing
TOTAL_PRECIPITATION_QUEBECnumeric683 unique values
1227 missing
MEAN_TEMPERATURE_SASKATOONnumeric672 unique values
2473 missing
TOTAL_PRECIPITATION_SASKATOONnumeric282 unique values
3755 missing
MEAN_TEMPERATURE_STJOHNSnumeric637 unique values
808 missing
TOTAL_PRECIPITATION_STJOHNSnumeric654 unique values
822 missing
MEAN_TEMPERATURE_TORONTOnumeric523 unique values
74 missing
TOTAL_PRECIPITATION_TORONTOnumeric385 unique values
80 missing
MEAN_TEMPERATURE_VANCOUVERnumeric354 unique values
53 missing
TOTAL_PRECIPITATION_VANCOUVERnumeric409 unique values
55 missing
MEAN_TEMPERATURE_WHITEHORSEnumeric667 unique values
1691 missing
TOTAL_PRECIPITATION_WHITEHORSEnumeric181 unique values
4095 missing
MEAN_TEMPERATURE_WINNIPEGnumeric1287 unique values
124 missing
TOTAL_PRECIPITATION_WINNIPEGnumeric528 unique values
247 missing

19 properties

29221
Number of instances (rows) of the dataset.
27
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
53518
Number of missing values in the dataset.
12445
Number of instances with at least one value missing.
26
Number of numeric attributes.
0
Number of nominal attributes.
0
Number of attributes divided by the number of instances.
96.3
Percentage of numeric attributes.
Percentage of instances belonging to the most frequent class.
0
Percentage of nominal attributes.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
42.59
Percentage of instances having missing values.
Average class difference between consecutive instances.
6.78
Percentage of missing values.

0 tasks

Define a new task