January 1, 2009

Mining Multi-label Concept-Drifting Streams Using Ensemble Classifiers

Puntos clave

Los puntos clave no están disponibles para este artículo en este momento.

Resumen

The problem of mining single-label data streams has been extensively studied in recent years. However, not enough attention has been paid to the problem of mining multi-label data streams. In this paper, a weighted voting ensemble approach is proposed to tackle this problem. We partition the incoming data stream into sequential chunks, and use binary relevance method to transform each chunk into a set of single-label chunks, which could be learned by binary classification algorithm. We train an ensemble of classifiers from the transformed chunks, and the classifiers in the ensemble are weighted based on their expected classification accuracy on the test data under the time-evolving environment. We also proposed a method for simulating multi-label data stream with concept drifting. Our empirical study on synthetic data set shows that the proposed approach has substantial advantage over majority voting ensemble approach.

Mining Multi-label Concept-Drifting Streams Using Ensemble Classifiers

Puntos clave

Resumen

Cite This Study