OpenML
Having a Blast: Meta-Learning and Heterogeneous Ensembles for Data Streams

Having a Blast: Meta-Learning and Heterogeneous Ensembles for Data Streams

Created 22-03-2016 by Jan van Rijn Visibility: public
Loading wiki
Ensembles of classifiers are among the best performing classifiers available in many data mining applications. However, most ensembles developed specifically for the dynamic data stream setting rely on only one type of base-level classifier, most often Hoeffding Trees. In this study, we investigate the use of heterogeneous ensembles, comprised of fundamentally different model types. Heterogeneous ensembles have proven successful in the classical batch data setting, however they do not easily transfer to the data stream setting. We therefore introduce the Online Performance Estimation framework, which can be used in data stream ensembles to weight the votes of (heterogeneous) ensemble members differently across the stream. Experiments over a wide range of data streams show performance that is competitive with state of the art ensemble techniques, including Online Bagging and Leveraging Bagging.