4.7 Article

RankRC: Large-Scale Nonlinear Rare Class Ranking

Journal

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume 27, Issue 12, Pages 3347-3359

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TKDE.2015.2453171

Keywords

Machine learning; kernel-based learning; imbalanced classification; ranking loss; large-scale algorithms

Funding

  1. National Sciences and Engineering Research Council of Canada
  2. Ophelia Lazaridis University Research Chair

Ask authors/readers for more resources

Rare class problems are common in real-world applications across a wide range of domains. Standard classification algorithms are known to perform poorly in these cases, since they focus on overall classification accuracy. In addition, we have seen a significant increase of data in recent years, resulting in many large scale rare class problems. In this paper, we focus on nonlinear kernel based classification methods expressed as a regularized loss minimization problem. We address the challenges associated with both rare class problems and large scale learning, by 1) optimizing area under curve of the receiver of operator characteristic in the training process, instead of classification accuracy and 2) using a rare class kernel representation to achieve an efficient time and space algorithm. We call the algorithm RankRC. We provide justifications for the rare class representation and experimentally illustrate the effectiveness of RankRC in test performance, computational complexity, and model robustness.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available