Data
US-Births

US-Births

active ARFF Creative Commons Attribution 4.0 International Visibility: public Uploaded 25-06-2024 by Bruno Belucci Teixeira
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Number of births in the United States. From original source: ----- Number of births in the United States. There are several data sets covering different date ranges and obtaining data from different sources. ----- This dataset correspond to the births betweehn 1968-1988. There are 4 columns: id_series: The id of the time series. date: The date of the time series in the format "%Y-%m-%d". time_step: The time step on the time series. value_0: The values of the time series, which will be used for the forecasting task. Preprocessing: 1 - Created 'date' column with columns 'year', 'month', 'day', in the format %Y-%m-%d. 2 - Dropped columns 'year', 'month', 'day', 'day_of_year', 'day_of_week'. 3 - Created the column 'id_series' with value 0, there is only one long time series. 4 - Ensured that there are no missing dates and that the frequency of the time_series is daily. 5 - Created column 'time_step' with increasing values of time step for the time series. 6 - Renamed column 'births' to 'value_0'. 6 - Casted column 'value_0' to int. Defined 'id_series' as 'category'.

4 features

id_seriesnominal1 unique values
0 missing
datestring7305 unique values
0 missing
value_0numeric3538 unique values
0 missing
time_stepnumeric7305 unique values
0 missing

19 properties

7305
Number of instances (rows) of the dataset.
4
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
2
Number of numeric attributes.
1
Number of nominal attributes.
50
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
25
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
Average class difference between consecutive instances.

0 tasks

Define a new task