OpenML

Study

JavaScript is required to properly view the contents of this page!

Explore
- Data
- Task
- Flow
- Run
- Study
- Task type
- Measure
- People
Help
Blog
Contact
Please cite us

14 results

Having a Blast: Meta-Learning and Heterogeneous Ensembles for Data Streams

Ensembles of classifiers are among the best performing classifiers available in many data mining applications. However, most ensembles developed specifically for the dynamic data stream setting rely…

0 datasets, 0 tasks, 0 flows, 0 runs

Collaborative, reproducible benchmarking and analysis

Benchmarking in Machine Learning is often much more difficult than it seems, and hard to reproduce. This study is a new approach to do a collaborative, in-depth benchmarking of algorithms, and allows…

0 datasets, 0 tasks, 0 flows, 0 runs

Heterogeneous Ensembles for Data Streams

Ensembles of classifiers are among the best performing classifiers available in many data mining applications. Rather than training one classifier, multiple classifiers are trained, and their…

0 datasets, 0 tasks, 0 flows, 0 runs

Subgroup Discovery

A subgroup discovery study.

0 datasets, 0 tasks, 0 flows, 0 runs

Data Streams and more

this study joins multiple data stream studies

0 datasets, 0 tasks, 0 flows, 0 runs

Massively Collaborative Machine Learning

All datasets, tasks, flows and setups used for Chapter 6 in the PhD Thesis "Massively Collaborative Machine Learning"

0 datasets, 0 tasks, 0 flows, 0 runs

Speeding up Algorithm Selection via Meta-learning and Active Testing

Authors: Salisu Mamman Abdulrahman, Pavel Brazdil, Jan N. van Rijn, Joaquin Vanschoren Abstract: Algorithm selection methods can be speeded-up substantially by incorporating multi-objective measures…

0 datasets, 0 tasks, 0 flows, 0 runs

ASLib OpenML Scenario

Containing all datasets, tasks, flows and runs used in the ASLib OpenML Scenario.

0 datasets, 0 tasks, 0 flows, 0 runs

Hyperparameter Importance Across Datasets

With the advent of automated machine learning, automated hyperparameter optimization methods are by now routinely used. However, this progress is not yet matched by equal progress on automatic…

0 datasets, 0 tasks, 0 flows, 0 runs

Linear vs. Non Linear

Comparison of linear and non-linear models. [Jupyter Notebook](https://github.com/janvanrijn/linear-vs-non-linear/blob/master/notebook/Linear-vs-Non-Linear.ipynb)

0 datasets, 0 tasks, 0 flows, 0 runs

OpenML100-friendly

Subset of the OpenML100, with datasets that are friedly towards scikit-learn algorithms (no Imputation or One-hot-encoding necessary)

0 datasets, 0 tasks, 0 flows, 0 runs

Does Feature Selection Improve Classification?

Feature selection can be of value to classification for a variety of reasons. Real world data sets can be rife with irrelevant features, especially if the data was not gather specifically for the…

394 datasets, 394 tasks, 24 flows, 9454 runs

OpenML-CC18 Curated Classification benchmark

We advocate the use of curated, comprehensive benchmark suites of machine learning datasets, backed by standardized OpenML-based interfaces and complementary software toolkits written in Python, Java…

72 datasets, 72 tasks, 0 flows, 0 runs

Forex

Contains currency trading tasks, for various valuta pairs.

192 datasets, 192 tasks, 0 flows, 0 runs