4.5 Article

Graph-based local concept coordinate factorization

Journal

KNOWLEDGE AND INFORMATION SYSTEMS
Volume 43, Issue 1, Pages 103-126

Publisher

SPRINGER LONDON LTD
DOI: 10.1007/s10115-013-0715-x

Keywords

Manifold kernel learning; Local coordinate coding; Graph Laplacian; Concept factorization; Clustering

Funding

  1. National Natural Science Foundation of China [91120302, 61222207, 61173185, 61173186]
  2. National Basic Research Program of China (973 Program) [2013CB336500]
  3. Fundamental Research Funds for the Central Universities [2012FZA5017]
  4. Zhejiang Province Key S&T Innovation Group Project [2009R50009]

Ask authors/readers for more resources

Ubiquitous data are increasingly expanding in large volumes due to human activities, and grouping them into appropriate clusters is an important and yet challenging problem. Existing matrix factorization techniques have shown their significant power in solving this problem, e.g., nonnegative matrix factorization, concept factorization. Recently, one state-of-the-art method called locality-constrained concept factorization is put forward, but its locality constraint does not well reveal the intrinsic data structure since it only requires the concept to be as close to the original data points as possible. To address this issue, we present a graph-based local concept coordinate factorization (GLCF) method, which respects the intrinsic structure of the data through manifold kernel learning in the warped Reproducing Kernel Hilbert Space. Besides, a generalized update algorithm is developed to handle data matrices containing both positive and negative entries. Since GLCF is essentially based on the local coordinate coding and concept factorization, it inherits many advantageous properties, such as the locality and sparsity of the data representation. Moreover, it can better encode the locally geometrical structure via graph Laplacian in the manifold adaptive kernel. Therefore, a more compact and better structured representation can be obtained in the low-dimensional data space. Extensive experiments on several image and gene expression databases suggest the superiority of the proposed method in comparison with some alternatives.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available