4.5 Article

On the equivalence between Non-negative Matrix Factorization and Probabilistic Latent Semantic Indexing

期刊

COMPUTATIONAL STATISTICS & DATA ANALYSIS
卷 52, 期 8, 页码 3913-3927

出版社

ELSEVIER
DOI: 10.1016/j.csda.2008.01.011

关键词

-

资金

  1. Division of Computing and Communication Foundations
  2. Direct For Computer & Info Scie & Enginr [0830780] Funding Source: National Science Foundation

向作者/读者索取更多资源

Non-negative Matrix Factorization (NMF) and Probabilistic Latent Semantic Indexing (PLSI) have been successfully applied to document clustering recently. In this paper, we show that PLSI and NMF (with the I-divergence objective function) optimize the same objective function, although PLSI and NMF are different algorithms as verified by experiments. This provides a theoretical basis for a new hybrid method that runs PLSI and NMF alternatively, each jumping out of the local minima of the other method successively, thus achieving a better final solution. Extensive experiments on five real-life datasets show relations between NMF and PLSI, and indicate that the hybrid method leads to significant improvements over NMF-only or PLSI-only methods. We also show that at first-order approximation, NMF is identical to the X-2-statistic. (c) 2008 Published by Elsevier B.V.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据