☆ 4.6 Article

Exploit latent Dirichlet allocation for collaborative filtering

FRONTIERS OF COMPUTER SCIENCE (2018)

Journal

FRONTIERS OF COMPUTER SCIENCE

Volume 12, Issue 3, Pages 571-581

Publisher

HIGHER EDUCATION PRESS

DOI: 10.1007/s11704-016-6078-1

Keywords

latent Dirichlet allocation; one-class collaborative filtering; multi-class collaborative filtering

Funding

National Natural Science Foundation of China (NSFC) [61370126, 61672081, 71540028, 61571052, 61602237]
National High-tech R&D Program of China [2015AA016004]
Beijing Advanced Innovation Center for Imaging Technology [BAICIT-2016001]
Fund of the State Key Laboratory of Software Development Environment [SKLSDE-2013ZX-19]
Fund of Beijing Social Science [14JGC103]
Statistics Research Project of National Bureau [2013LY055]
Fund of Beijing Wuzi University, China [GJB20141002]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Abstract

Previous work on the one-class collaborative filtering (OCCF) problem can be roughly categorized into pointwise methods, pairwise methods, and content-based methods. A fundamental assumption of these approaches is that all missing values in the user-item rating matrix are considered negative. However, this assumption may not hold because the missing values may contain negative and positive examples. For example, a user who fails to give positive feedback about an item may not necessarily dislike it; he may simply be unfamiliar with it. Meanwhile, content-based methods, e.g. collaborative topic regression (CTR), usually require textual content information of the items, and thus their applicability is largely limited when the text information is not available. In this paper, we propose to apply the latent Dirichlet allocation (LDA) model on OCCF to address the above-mentioned problems. The basic idea of this approach is that items are regarded as words, users are considered as documents, and the user-item feedback matrix constitutes the corpus. Our model drops the strong assumption that missing values are all negative and only utilizes the observed data to predict a user's interest. Additionally, the proposed model does not need content information of the items. Experimental results indicate that the proposed method outperforms previous methods on various ranking-oriented evaluation metrics. We further combine this method with a matrix factorization-based method to tackle the multi-class collaborative filtering (MCCF) problem, which also achieves better performance on predicting user ratings.

Exploit latent Dirichlet allocation for collaborative filtering

Journal

FRONTIERS OF COMPUTER SCIENCE

Publisher

HIGHER EDUCATION PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Exploit latent Dirichlet allocation for collaborative filtering

Journal

FRONTIERS OF COMPUTER SCIENCE

Publisher

HIGHER EDUCATION PRESS

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper