☆ 4.5 Article

Sparse Biclustering of Transposable Data

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2014)

期刊

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS

卷 23, 期 4, 页码 985-1008

出版社

AMER STATISTICAL ASSOC

DOI: 10.1080/10618600.2013.852554

关键词

l(1) penalty; Gene expression; Unsupervised learning; Matrix-variate normal distribution; Clustering

类别

Statistics & Probability

资金

NIH [DP5OD009145]
NSF CAREER [DMS-1252624]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

We consider the task of simultaneously clustering the rows and columns of a large transposable data matrix. We assume that the matrix elements are normally distributed with a bicluster-specific mean term and a common variance, and perform biclustering by maximizing the corresponding log-likelihood. We apply an l(1) penalty to the means of the biclusters to obtain sparse and interpretable biclusters. Our proposal amounts to a sparse, symmetrized version of k-means clustering. We show that k-means clustering of the rows and of the columns of a data matrix can be seen as special cases of our proposal, and that a relaxation of our proposal yields the singular value decomposition. In addition, we propose a framework for biclustering based on the matrix-variate normal distribution. The performances of our proposals are demonstrated in a simulation study and on a gene expression dataset. This article has supplementary material online.

Sparse Biclustering of Transposable Data

期刊

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS

出版社

AMER STATISTICAL ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Sparse Biclustering of Transposable Data

期刊

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS

出版社

AMER STATISTICAL ASSOC

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文