4.5 Article

Multi-window based ensemble learning for classification of imbalanced streaming data

Journal

WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS
Volume 20, Issue 6, Pages 1507-1525

Publisher

SPRINGER
DOI: 10.1007/s11280-017-0449-x

Keywords

Streaming data; Class imbalance; Multi-window; Ensemble learning

Funding

  1. ARC DP project [DP 130101327]
  2. 973 Program [2013CB329601, 2013CB329602, 2013CB329604]
  3. 863 Program [2012AA01A401, 2012AA01A402]

Ask authors/readers for more resources

Imbalanced streaming data is commonly encountered in real-world data mining and machine learning applications, and has attracted much attention in recent years. Both imbalanced data and streaming data in practice are normally encountered together; however, little research work has been studied on the two types of data together. In this paper, we propose a multi-window based ensemble learning method for the classification of imbalanced streaming data. Three types of windows are defined to store the current batch of instances, the latest minority instances, and the ensemble classifier. The ensemble classifier consists of a set of latest sub-classifiers, and the instances employed to train each sub-classifier. All sub-classifiers are weighted prior to predicting the class labels of newly arriving instances, and new sub-classifiers are trained only when the precision is below a predefined threshold. Extensive experiments on synthetic datasets and real-world datasets demonstrate that the new approach can efficiently and effectively classify imbalanced streaming data, and generally outperforms existing approaches.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available