4.5 Review

A survey on data stream clustering and classification

Journal

KNOWLEDGE AND INFORMATION SYSTEMS
Volume 45, Issue 3, Pages 535-569

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s10115-014-0808-1

Keywords

Data stream mining; Clustering; Classification; Survey

Funding

  1. Wattalyzer grant project of I2R, Singapore [NRF2012EWT-EIRP002-044]

Ask authors/readers for more resources

Nowadays, with the advance of technology, many applications generate huge amounts of data streams at very high speed. Examples include network traffic, web click streams, video surveillance, and sensor networks. Data stream mining has become a hot research topic. Its goal is to extract hidden knowledge/patterns from continuous data streams. Unlike traditional data mining where the dataset is static and can be repeatedly read many times, data stream mining algorithms face many challenges and have to satisfy constraints such as bounded memory, single-pass, real-time response, and concept-drift detection. This paper presents a comprehensive survey of the state-of-the-art data stream mining algorithms with a focus on clustering and classification because of their ubiquitous usage. It identifies mining constraints, proposes a general model for data stream mining, and depicts the relationship between traditional data mining and data stream mining. Furthermore, it analyzes the advantages as well as limitations of data stream algorithms and suggests potential areas for future research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available