4.6 Article

Right singular vector projection graphs: fast high dimensional covariance matrix estimation under latent confounding

出版社

WILEY
DOI: 10.1111/rssb.12359

关键词

Causal structure learning; Covariance matrix; Graphical models; High dimensional data; Latent confounding

资金

  1. Engineering and Physical Sciences Research Council 'First' grant [EP/R013381/1]
  2. Alan Turing Institute under Engineering and Physical Sciences Research Council [EP/N510129/1]
  3. EPSRC [EP/R013381/1] Funding Source: UKRI

向作者/读者索取更多资源

We consider the problem of estimating a high dimensional pxp covariance matrix sigma, given n observations of confounded data with covariance sigma+Gamma Gamma T, where Gamma is an unknown pxq matrix of latent factor loadings. We propose a simple and scalable estimator based on the projection onto the right singular vectors of the observed data matrix, which we call right singular vector projection (RSVP). Our theoretical analysis of this method reveals that, in contrast with approaches based on the removal of principal components, RSVP can cope well with settings where the smallest eigenvalue of Gamma T Gamma is relatively close to the largest eigenvalue of sigma, as well as when the eigenvalues of Gamma T Gamma are diverging fast. RSVP does not require knowledge or estimation of the number of latent factors q, but it recovers sigma only up to an unknown positive scale factor. We argue that this suffices in many applications, e.g. if an estimate of the correlation matrix is desired. We also show that, by using subsampling, we can further improve the performance of the method. We demonstrate the favourable performance of RSVP through simulation experiments and an analysis of gene expression data sets collated by the GTEX consortium.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据