☆ 4.6 Article

Semi-Supervised Text Classification With Universum Learning

IEEE TRANSACTIONS ON CYBERNETICS (2016)

Journal

IEEE TRANSACTIONS ON CYBERNETICS

Volume 46, Issue 2, Pages 462-473

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

DOI: 10.1109/TCYB.2015.2403573

Keywords

AdaBoost; learning with Universum; text classification

Funding

National Science Council [NSC-103-2221-E-009-153]
Aim for Top University Project of National Taiwan Normal University
Ministry of Education, Taiwan

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Universum, a collection of nonexamples that do not belong to any class of interest, has become a new research topic in machine learning. This paper devises a semi-supervised learning with Universum algorithm based on boosting technique, and focuses on situations where only a few labeled examples are available. We also show that the training error of AdaBoost with Universum is bounded by the product of normalization factor, and the training error drops exponentially fast when each weak classifier is slightly better than random guessing. Finally, the experiments use four data sets with several combinations. Experimental results indicate that the proposed algorithm can benefit from Universum examples and outperform several alternative methods, particularly when insufficient labeled examples are available. When the number of labeled examples is insufficient to estimate the parameters of classification functions, the Universum can be used to approximate the prior distribution of the classification functions. The experimental results can be explained using the concept of Universum introduced by Vapnik, that is, Universum examples implicitly specify a prior distribution on the set of classification functions.

Semi-Supervised Text Classification With Universum Learning

Journal

IEEE TRANSACTIONS ON CYBERNETICS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Semi-Supervised Text Classification With Universum Learning

Journal

IEEE TRANSACTIONS ON CYBERNETICS

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper