4.7 Article

Enhanced Localization of Genetic Samples through Linkage-Disequilibrium Correction

期刊

AMERICAN JOURNAL OF HUMAN GENETICS
卷 92, 期 6, 页码 882-894

出版社

CELL PRESS
DOI: 10.1016/j.ajhg.2013.04.023

关键词

-

资金

  1. National Cancer Institute of the National Institutes of Health [R03-CA162200]
  2. German-Israeli Foundation [109433.2/2010]
  3. Israeli Science Foundation [04514831]
  4. Edmond J. Safra Center for Bioinformatics at Tel-Aviv University
  5. National Science Foundation [III-1217615]
  6. Direct For Computer & Info Scie & Enginr
  7. Div Of Information & Intelligent Systems [1217615] Funding Source: National Science Foundation

向作者/读者索取更多资源

Characterizing the spatial patterns of genetic diversity in human populations has a wide range of applications, from detecting genetic mutations associated with disease to inferring human history. Current approaches, including the widely used principal-component analysis, are not suited for the analysis of linked markers, and local and long-range linkage disequilibrium (LD) can dramatically reduce the accuracy of spatial localization when unaccounted for. To overcome this, we have introduced an approach that performs spatial localization of individuals on the basis of their genetic data and explicitly models LD among markers by using a multivariate normal distribution. By leveraging external reference panels, we derive closed-form solutions to the optimization procedure to achieve a computationally efficient method that can handle large data sets. We validate the method on empirical data from a large sample of European individuals from the POPRES data set, as well as on a large sample of individuals of Spanish ancestry. First, we show that by modeling LD, we achieve accuracy superior to that of existing methods. Importantly, whereas other methods show decreased performance when dense marker panels are used in the inference, our approach improves in accuracy as more markers become available. Second, we show that accurate localization of genetic data can be achieved with only a part of the genome, and this could potentially enable the spatial localization of admixed samples that have a fraction of their genome originating from a given continent. Finally, we demonstrate that our approach is resistant to distortions resulting from long-range LD regions; such distortions can dramatically bias the results when unaccounted for.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据