Hayes-Roth Database This is a merged version of the separate train and test set which are usually distributed. On OpenML this train-test split can be found as one of the possible tasks. Source…
0 runs0 likes0 downloads0 reach0 impact
160 instances - 5 features - 3 classes - 0 missing values
Citation Request: This primary tumor domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data.…
0 runs0 likes0 downloads0 reach0 impact
339 instances - 18 features - 21 classes - 225 missing values
1. Title: Glass Identification Database 2. Sources: (a) Creator: B. German -- Central Research Establishment Home Office Forensic Science Service Aldermaston, Reading, Berkshire RG7 4PN (b) Donor:…
0 runs0 likes0 downloads0 reach0 impact
214 instances - 10 features - 6 classes - 0 missing values
The aim is to determine the type of arrhythmia from the ECG recordings. This database contains 279 attributes, 206 of which are linear valued and the rest are nominal. Concerning the study of H. Altay…
0 runs0 likes0 downloads0 reach0 impact
452 instances - 280 features - 13 classes - 408 missing values
The original Annealing dataset from UCI. The exact meaning of the features and classes is largely unknown. Annealing, in metallurgy and materials science, is a heat treatment that alters the physical…
0 runs0 likes0 downloads0 reach0 impact
898 instances - 39 features - 5 classes - 22175 missing values
From original source: ----- Author: Jeffrey C. Schlimmer (Jeffrey.Schlimmer@a.gp.cs.cmu.edu) Source: UCI - 1987 Please cite: 1985 Auto Imports Database This data set consists of three types of…
0 runs0 likes0 downloads0 reach0 impact
205 instances - 26 features - 6 classes - 59 missing values
From original source: ----- The dataset contains count of public bicycles rented per hour in the Seoul Bike Sharing System, with corresponding weather data and holiday information Additional…
0 runs0 likes0 downloads0 reach0 impact
8760 instances - 18 features - 0 classes - 0 missing values
From original source: ----- Description Data Description The aim of this dataset is to predict the burned area of forest fires, in the northeast region of Portugal, by using meteorological and other…
0 runs0 likes0 downloads0 reach0 impact
517 instances - 13 features - 0 classes - 0 missing values
From original source: ----- The dataset consists of 384 features extracted from CT images. The class variable is numeric and denotes the relative location of the CT slice on the axial axis of the…
0 runs0 likes0 downloads0 reach0 impact
53500 instances - 385 features - 0 classes - 0 missing values
From original source: ----- The local stability analysis of the 4-node star system (electricity producer is in the center) implementing Decentral Smart Grid Control concept. Additional Information The…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 13 features - 0 classes - 0 missing values
From original source: ----- The local stability analysis of the 4-node star system (electricity producer is in the center) implementing Decentral Smart Grid Control concept. Additional Information The…
0 runs0 likes0 downloads0 reach0 impact
10000 instances - 13 features - 2 classes - 0 missing values
From original source: ----- The dataset contains count of public bicycles rented per hour in the Seoul Bike Sharing System, with corresponding weather data and holiday information Additional…
0 runs0 likes0 downloads0 reach0 impact
8760 instances - 14 features - 0 classes - 0 missing values
From original source: ----- Data set containing values for 8 attributes (molecular descriptors) of 546 chemicals used to predict quantitative acute aquatic toxicity towards Daphnia Magna.. Additional…
0 runs0 likes0 downloads0 reach0 impact
546 instances - 9 features - 0 classes - 0 missing values
From original source: ----- The UJIIndoorLoc is a Multi-Building Multi-Floor indoor localization database to test Indoor Positioning System that rely on WLAN/WiFi fingerprint. Additional Information…
0 runs0 likes0 downloads0 reach0 impact
21048 instances - 526 features - 0 classes - 0 missing values
From original source: ----- The UJIIndoorLoc is a Multi-Building Multi-Floor indoor localization database to test Indoor Positioning System that rely on WLAN/WiFi fingerprint. Additional Information…
0 runs0 likes0 downloads0 reach0 impact
21048 instances - 526 features - 0 classes - 0 missing values
From original source: ----- Data was from a simulation of a servo system Additional Information Ross Quinlan: This data was given to me by Karl Ulrich at MIT in 1986. I didn't record his description…
0 runs0 likes0 downloads0 reach0 impact
167 instances - 5 features - 0 classes - 0 missing values
From original source: ----- Context Machine Learning with R by Brett Lantz is a book that provides an introduction to machine learning using R. As far as I can tell, Packt Publishing does not make its…
0 runs0 likes0 downloads0 reach0 impact
1338 instances - 7 features - 0 classes - 0 missing values
From original source: ----- KEGG Metabolic pathways modeled as directed relation network. Variety of graphical features presented. Additional Information KEGG Metabolic pathways can be realized into…
0 runs0 likes0 downloads0 reach0 impact
53413 instances - 23 features - 0 classes - 0 missing values
From original source: ----- Communities within the United States. The data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and crime data from…
0 runs0 likes0 downloads0 reach0 impact
209 instances - 9 features - 0 classes - 0 missing values
From original source: ----- Communities within the United States. The data combines socio-economic data from the 1990 US Census, law enforcement data from the 1990 US LEMAS survey, and crime data from…
0 runs0 likes0 downloads0 reach0 impact
1994 instances - 123 features - 0 classes - 36851 missing values
From original source: ----- This hourly data set contains the PM2.5 data of US Embassy in Beijing. Meanwhile, meteorological data from Beijing Capital International Airport are also included.…
0 runs0 likes0 downloads0 reach0 impact
41757 instances - 12 features - 0 classes - 0 missing values
From original source: ----- About this file What are the things that a potential home buyer considers before purchasing a house? The location, the size of the property, vicinity to offices, schools,…
0 runs0 likes0 downloads0 reach0 impact
13320 instances - 8 features - 0 classes - 682 missing values
From original source: ----- Additional Information The data set is at 10 min for about 4.5 months. The house temperature and humidity conditions were monitored with a ZigBee wireless sensor network.…
0 runs0 likes0 downloads0 reach0 impact
19735 instances - 28 features - 0 classes - 0 missing values
From original source: ----- As part of our continued commitment towards the WWW community and its success, we are again making available the entire data sets for this set of surveys. This enables…
0 runs0 likes0 downloads0 reach0 impact
10108 instances - 69 features - 2 classes - 309 missing values
From original source: ----- Additional Information Information about customers consists of 86 variables and includes product usage data and socio-demographic data derived from zip area codes. The data…
0 runs0 likes0 downloads0 reach0 impact
9822 instances - 86 features - 2 classes - 0 missing values
From original source: ----- Context "Predict behavior to retain customers. You can analyze all relevant customer data and develop focused customer retention programs." [IBM Sample Data Sets] Content…
0 runs0 likes0 downloads0 reach0 impact
7043 instances - 20 features - 2 classes - 11 missing values
From original source: ----- Additional Information The dataset includes 244 instances that regroup a data of two regions of Algeria,namely the Bejaia region located in the northeast of Algeria and the…
0 runs0 likes0 downloads0 reach0 impact
243 instances - 14 features - 2 classes - 0 missing values
daily pickup data for 329 FHV companies from January 2015 through August 2015. From original source: ----- There is also a file other-FHV-data-jan-aug-2015.csv containing daily pickup data for 329 FHV…
0 runs0 likes0 downloads0 reach0 impact
29826 instances - 5 features - classes - 9276 missing values
Quarterly Database for Macroeconomic Research From original website: ----- FRED-MD and FRED-QD are large macroeconomic databases designed for the empirical analysis of 'big data'. The datasets of…
0 runs0 likes0 downloads0 reach0 impact
258 instances - 204 features - classes - 0 missing values
Electricity Load Diagrams between 2011 and 2014, resampled hourly. From original source: ----- Data set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be…
0 runs0 likes0 downloads0 reach0 impact
26305 instances - 319 features - classes - 0 missing values
Wind power production in MW recorded per every 4 seconds starting from 01/08/2019 in Australia. From the website: ----- This dataset contains a single very long daily time series representing the wind…
0 runs0 likes0 downloads0 reach0 impact
7397147 instances - 4 features - classes - 0 missing values
Solar power production in MW recorded per every 4 seconds starting from 01/08/2019 in Australia. From the website: ----- This dataset contains a single very long daily time series representing the…
0 runs0 likes0 downloads0 reach0 impact
7397222 instances - 4 features - classes - 0 missing values
Number of births in the United States. From original source: ----- Number of births in the United States. There are several data sets covering different date ranges and obtaining data from different…
0 runs0 likes0 downloads0 reach0 impact
7305 instances - 4 features - classes - 0 missing values
Mean daily flow in cubic meters per second (cumecs) of the Saugeen River. From original source: ----- Mean daily flow in cubic meters per second (cumecs) of the Saugeen River at Walkerton, Jan 1, 1915…
0 runs0 likes0 downloads0 reach0 impact
23741 instances - 4 features - classes - 0 missing values
Daily total sunspot number from 1818 to 2023. From original source: ----- Time range: 1/1/1818 - last elapsed month (provisional values) Data description: Daily total sunspot number derived by the…
0 runs0 likes0 downloads0 reach0 impact
75233 instances - 6 features - classes - 6480 missing values
Daily values of confirmed cases, deaths and recovers for COVID-19 in US. From original source: ----- MThis folder contains daily time series summary tables, including confirmed, deaths and recovered.…
0 runs0 likes0 downloads0 reach0 impact
3819906 instances - 15 features - classes - 0 missing values
Daily values of confirmed cases, deaths and recovers for COVID-19 in several countries. From original source: ----- MThis folder contains daily time series summary tables, including confirmed, deaths…
0 runs0 likes0 downloads0 reach0 impact
313182 instances - 10 features - classes - 0 missing values
Monthly patient count for products that are related to medical problems. From original source: ----- Monthly patient count for products that are related to medical problems. There are 767 time series…
0 runs0 likes0 downloads0 reach0 impact
64428 instances - 6 features - classes - 0 missing values
Uber, Lyft and weather hourly data. From original website: ----- Context Uber and Lyft's ride prices are not constant like public transport. They are greatly affected by the demand and supply of rides…
0 runs0 likes0 downloads0 reach0 impact
77904 instances - 21 features - classes - 0 missing values
Monthly sales car parts. 2674 series. Jan 1998 - Mar 2002. Extracted from 'expsmooth' R package. There are 2677 columns: id_series: The id of the time series. date: The date of the time series in the…
0 runs0 likes0 downloads0 reach0 impact
51 instances - 2677 features - classes - 0 missing values
Pedestrian Counting System published by the city of Melbourne, preprocessed data. From original source: ----- This dataset contains hourly pedestrian counts since 2009 from pedestrian sensor devices…
0 runs0 likes0 downloads0 reach0 impact
43824 instances - 54 features - classes - 151586 missing values
Bitcoin data scrapped from BitInfoCharts, without 'tweets' and with preprocessing. Several Bitcoin related data scrapped directly from BitInfoCharts. 'date' in the format %Y-%m-%d. The 'tweets' column…
0 runs0 likes0 downloads0 reach0 impact
4792 instances - 21 features - classes - 0 missing values
Bitcoin data scrapped from BitInfoCharts, with preprocessing. Several Bitcoin related data scrapped directly from BitInfoCharts. 'date' in the format %Y-%m-%d. We have only kept the rows between the…
0 runs0 likes0 downloads0 reach0 impact
3262 instances - 22 features - classes - 0 missing values
Dominick dataset forecasting data, weekly data. From original source: ----- This dataset contains 115704 weekly time series representing the profit of individual stock keeping units from a retailer.…
0 runs0 likes0 downloads0 reach0 impact
19092987 instances - 3 features - classes - 0 missing values
Australian Electricity Demand forecasting data, half-hourly data. From original source: ----- This dataset contains 5 time series representing the half hourly electricity demand of 5 states in…
0 runs0 likes0 downloads0 reach0 impact
1155264 instances - 5 features - classes - 0 missing values
CIF 2016 time series forecasting competition , monthly data. From original source: ----- Competition Data Format Data file containing time series to be predicted is a text file having the following…
0 runs0 likes0 downloads0 reach0 impact
7108 instances - 3 features - classes - 0 missing values
Tourism competion for time series forecasting, monthly data From original source: ----- The data we use include 366 monthly series, 427 quarterly series and 518 yearly series. They were supplied by…
0 runs0 likes0 downloads0 reach0 impact
109280 instances - 4 features - classes - 0 missing values
Tourism competion for time series forecasting, quarterly data. From original source: ----- The data we use include 366 monthly series, 427 quarterly series and 518 yearly series. They were supplied by…
0 runs0 likes0 downloads0 reach0 impact
42544 instances - 4 features - classes - 0 missing values
Tourism competion for time series forecasting, yearly data. From original source: ----- The data we use include 366 monthly series, 427 quarterly series and 518 yearly series. They were supplied by…
0 runs0 likes0 downloads0 reach0 impact
12678 instances - 4 features - classes - 0 missing values
M4-Competition for time series forecasting, hourly data. From original source: ----- The fourth competition, M4, started on 1 January 2018 and ended in 31 May 2018. The M4 extended and replicated the…
0 runs0 likes0 downloads0 reach0 impact
373372 instances - 5 features - classes - 0 missing values
M4-Competition for time series forecasting, daily data. From original source: ----- The fourth competition, M4, started on 1 January 2018 and ended in 31 May 2018. The M4 extended and replicated the…
0 runs0 likes0 downloads0 reach0 impact
10023836 instances - 5 features - classes - 0 missing values
M4-Competition for time series forecasting, weekly data. From original source: ----- The fourth competition, M4, started on 1 January 2018 and ended in 31 May 2018. The M4 extended and replicated the…
0 runs0 likes0 downloads0 reach0 impact
371579 instances - 5 features - classes - 0 missing values
M4-Competition for time series forecasting, monthly data. From original source: ----- The fourth competition, M4, started on 1 January 2018 and ended in 31 May 2018. The M4 extended and replicated the…
0 runs0 likes0 downloads0 reach0 impact
11246411 instances - 5 features - classes - 0 missing values
M4-Competition for time series forecasting, quarterly data. From original source: ----- The fourth competition, M4, started on 1 January 2018 and ended in 31 May 2018. The M4 extended and replicated…
0 runs0 likes0 downloads0 reach0 impact
2406108 instances - 5 features - classes - 0 missing values
M4-Competition for time series forecasting, yearly data From original source: ----- The fourth competition, M4, started on 1 January 2018 and ended in 31 May 2018. The M4 extended and replicated the…
0 runs0 likes0 downloads0 reach0 impact
858458 instances - 5 features - classes - 0 missing values
M3-Competition for time series forecasting, other data From original source: ----- The 3003 series of the M3-Competition were selected on a quota basis to include various types of time series data…
0 runs0 likes0 downloads0 reach0 impact
13325 instances - 4 features - classes - 0 missing values
M3-Competition for time series forecasting, monthly data. From original source: ----- The 3003 series of the M3-Competition were selected on a quota basis to include various types of time series data…
0 runs0 likes0 downloads0 reach0 impact
167562 instances - 5 features - classes - 0 missing values
M3-Competition for time series forecasting, quarterly data From original source: ----- The 3003 series of the M3-Competition were selected on a quota basis to include various types of time series data…
0 runs0 likes0 downloads0 reach0 impact
37004 instances - 5 features - classes - 0 missing values
M3-Competition for time series forecasting, yearly data. From original source: ----- The 3003 series of the M3-Competition were selected on a quota basis to include various types of time series data…
0 runs0 likes0 downloads0 reach0 impact
18319 instances - 5 features - classes - 0 missing values
Weekly U.S. Product Supplied of Finished Motor Gasoline (Thousand Barrels per Day). Provided by U.S. Energy Information AdministratiQon. Downloaded from the website as available on 24-05-2024. There…
0 runs0 likes0 downloads0 reach0 impact
1737 instances - 4 features - classes - 0 missing values
Weather measures from Versuchsbeete provided by the Max-Planck-Institute for Biogeochemistry Several weather measures provided by Max-Planck-Institute for Biogeochemistry from the Weather Station on…
0 runs0 likes0 downloads0 reach0 impact
916772 instances - 36 features - classes - 730081 missing values
Weather measures from Saaleaue provided by the Max-Planck-Institute for Biogeochemistry. Several weather measures provided by Max-Planck-Institute for Biogeochemistry from the Weather Station on Top…
0 runs0 likes0 downloads0 reach0 impact
1151865 instances - 33 features - classes - 391940 missing values
Weather measures from Beutenberg provided by the Max-Planck-Institute for Biogeochemistry Several weather measures provided by Max-Planck-Institute for Biogeochemistry from the Weather Station on Top…
0 runs0 likes0 downloads0 reach0 impact
1078742 instances - 24 features - classes - 243919 missing values
Monthly Database for Macroeconomic Research From original website: ----- FRED-MD and FRED-QD are large macroeconomic databases designed for the empirical analysis of 'big data'. The datasets of…
0 runs0 likes0 downloads0 reach0 impact
778 instances - 119 features - classes - 0 missing values
Hourly temperature and rainfall observation from the Bureau of Metereology of the Australian Government. From original source: ----- Historical rainfall and temperature forecast and observations…
0 runs0 likes0 downloads0 reach0 impact
8058447 instances - 8 features - classes - 492318 missing values
Electricity Load Diagrams between 2011 and 2014. From original source: ----- Data set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be divided by 4. Each…
0 runs0 likes0 downloads0 reach0 impact
105217 instances - 319 features - classes - 0 missing values
Outpatient Illness Surveillance weekly data. From original source: ----- Outpatient Illness Surveillance - Information on patient visits to health care providers for influenza-like illness is…
0 runs0 likes0 downloads0 reach0 impact
1099 instances - 12 features - classes - 0 missing values
Electric power distribution, 15 minutely data. From original source: ----- The electric power distribution problem is the distribution of electricity to different areas depends on its sequential…
0 runs0 likes0 downloads0 reach0 impact
69680 instances - 10 features - classes - 0 missing values
Electric power distribution, 15 minutely data. From original source: ----- The electric power distribution problem is the distribution of electricity to different areas depends on its sequential…
0 runs0 likes0 downloads0 reach0 impact
69680 instances - 10 features - classes - 0 missing values
Electric power distribution, hourly data. From original source: ----- The electric power distribution problem is the distribution of electricity to different areas depends on its sequential usage. But…
0 runs0 likes0 downloads0 reach0 impact
17420 instances - 10 features - classes - 0 missing values
Electric power distribution, hourly data. From original source: ----- The electric power distribution problem is the distribution of electricity to different areas depends on its sequential usage. But…
0 runs0 likes0 downloads0 reach0 impact
17420 instances - 10 features - classes - 0 missing values
Pedestrian Counting System published by the city of Melbourne, data with some minor preprocessing. From original website: ----- This dataset contains hourly pedestrian counts since 2009 from…
0 runs0 likes0 downloads0 reach0 impact
132209 instances - 110 features - classes - 8604673 missing values
Bitcoin data scrapped from BitInfoCharts, with some minor preprocessing. Several Bitcoin related data scrapped directly from BitInfoCharts. 'date' in the format %Y-%m-%d. We have only keep the rows…
0 runs0 likes0 downloads0 reach0 impact
3262 instances - 20 features - classes - 85 missing values
Bitcoin data scrapped from BitInfoCharts, without the 'tweets' values. Several Bitcoin related data scrapped directly from BitInfoCharts. 'date' in the format %Y-%m-%d. The 'tweets' column was dropped…
0 runs0 likes0 downloads0 reach0 impact
4792 instances - 19 features - classes - 402 missing values
Bitcoin data scrapped from BitInfoCharts. Several Bitcoin related data scrapped directly from BitInfoCharts. Raw data. 'date' in the format %Y-%m-%d
0 runs0 likes0 downloads0 reach0 impact
5629 instances - 20 features - classes - 9163 missing values
Wind Farm power production of 339 wind farms in Australia, minutely data. From the website: ----- This dataset contains very long minutely time series representing the wind power production of 339…
0 runs0 likes0 downloads0 reach0 impact
LondonSmartMeter forecasting data From the website: ----- Energy consumption readings for a sample of 5,567 London Households that took part in the UK Power Networks led Low Carbon London project…
0 runs0 likes0 downloads0 reach0 impact
LondonSmartMeter forecasting data From the website: ----- Energy consumption readings for a sample of 5,567 London Households that took part in the UK Power Networks led Low Carbon London project…
0 runs0 likes0 downloads0 reach0 impact
LondonSmartMeter forecasting data From the website: ----- Energy consumption readings for a sample of 5,567 London Households that took part in the UK Power Networks led Low Carbon London project…
0 runs0 likes0 downloads0 reach0 impact
Weather measures from Beutenberg provided by the Max-Planck-Institute for Biogeochemistry Several weather measures provided by Max-Planck-Institute for Biogeochemistry from the Weather Station on Top…
0 runs0 likes0 downloads0 reach0 impact