4.5 Article

Sparse Biclustering of Transposable Data

期刊

出版社

AMER STATISTICAL ASSOC
DOI: 10.1080/10618600.2013.852554

关键词

l(1) penalty; Gene expression; Unsupervised learning; Matrix-variate normal distribution; Clustering

资金

  1. NIH [DP5OD009145]
  2. NSF CAREER [DMS-1252624]

向作者/读者索取更多资源

We consider the task of simultaneously clustering the rows and columns of a large transposable data matrix. We assume that the matrix elements are normally distributed with a bicluster-specific mean term and a common variance, and perform biclustering by maximizing the corresponding log-likelihood. We apply an l(1) penalty to the means of the biclusters to obtain sparse and interpretable biclusters. Our proposal amounts to a sparse, symmetrized version of k-means clustering. We show that k-means clustering of the rows and of the columns of a data matrix can be seen as special cases of our proposal, and that a relaxation of our proposal yields the singular value decomposition. In addition, we propose a framework for biclustering based on the matrix-variate normal distribution. The performances of our proposals are demonstrated in a simulation study and on a gene expression dataset. This article has supplementary material online.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据