Data
Hospital

Hospital

active ARFF Creative Commons Attribution 4.0 International Visibility: public Uploaded 25-06-2024 by Bruno Belucci Teixeira
0 likes downloaded by 0 people , 0 total downloads 0 issues 0 downvotes
Issue #Downvotes for this reason By


Loading wiki
Help us complete this description Edit
Monthly patient count for products that are related to medical problems. From original source: ----- Monthly patient count for products that are related to medical problems. There are 767 time series that had a mean count of at least 10 and no zeros. ----- Extracted from 'expsmooth' R package (.csv available on official website) There are 6 columns: id_series: The id of the time series. date: The date of the time series in the format "%Y-%m-%d". time_step: The time step on the time series. value_0: The values of the time series, which will be used for the forecasting task. covariate_X (X from 0 to 1): Covariate values of the time series, tied to the 'id_series'. Not interested in forecasting, but can help with the forecasting task. Preprocessing: 1 - Melted the dataset with indentifiers 'MPriceHospLOS2000_SKUCode', 'MPriceHospLOS2000_RootEntityCode', obtaining columns 'date' and 'value'. 2 - Standardize the date to the format %Y-%m-%d. 3 - Renamed columns 'MPriceHospLOS2000_SKUCode', 'MPriceHospLOS2000_RootEntityCode' to 'covariate_0' and 'covariate_1'. 4 - Created column 'id_series' from covariate_0' and 'covariate_1' with index from 0 to 766. 5 - Created column 'time_step' with increasing values of the time_step for the time series. 6 - Casted 'value' columns to int, and defined 'id_series', covariate_0' and 'covariate_1' as 'category'.

6 features

covariate_0nominal71 unique values
0 missing
covariate_1nominal35 unique values
0 missing
datestring84 unique values
0 missing
value_0numeric3209 unique values
0 missing
id_seriesnominal767 unique values
0 missing
time_stepnumeric84 unique values
0 missing

19 properties

64428
Number of instances (rows) of the dataset.
6
Number of attributes (columns) of the dataset.
Number of distinct values of the target attribute (if it is nominal).
0
Number of missing values in the dataset.
0
Number of instances with at least one value missing.
2
Number of numeric attributes.
3
Number of nominal attributes.
0
Percentage of binary attributes.
0
Percentage of instances having missing values.
0
Percentage of missing values.
Average class difference between consecutive instances.
33.33
Percentage of numeric attributes.
0
Number of attributes divided by the number of instances.
50
Percentage of nominal attributes.
Percentage of instances belonging to the most frequent class.
Number of instances belonging to the most frequent class.
Percentage of instances belonging to the least frequent class.
Number of instances belonging to the least frequent class.
0
Number of binary attributes.

0 tasks

Define a new task