☆ 4.3 Article

A Generic Coalescent-based Framework for the Selection of a Reference Panel for Imputation

GENETIC EPIDEMIOLOGY (2010)

期刊

GENETIC EPIDEMIOLOGY

卷 34, 期 8, 页码 773-782

出版社

WILEY

DOI: 10.1002/gepi.20505

关键词

genotype imputation; coalescent; GWAS; linkage disequilibrium; weighted panel

类别

Genetics & Heredity Mathematical & Computational Biology

资金

National Science Foundation [IIS-071325412]
Israel Science Foundation [04514831]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

An important component in the analysis of genome-wide association studies involves the imputation of genotypes that have not been measured directly in the studied samples. The imputation procedure uses the linkage disequilibrium (LD) structure in the population to infer the genotype of an unobserved single nucleotide polymorphism. The LD structure is normally learned from a dense genotype map of a reference population that matches the studied population. In many instances there is no reference population that exactly matches the studied population, and a natural question arises as to how to choose the reference population for the imputation. Here we present a Coalescent-based method that addresses this issue. In contrast to the current paradigm of imputation methods, our method assigns a different reference dataset for each sample in the studied population, and for each region in the genome. This allows the flexibility to account for the diversity within populations, as well as across populations. Furthermore, because our approach treats each region in the genome separately, our method is suitable for the imputation of recently admixed populations. We evaluated our method across a large set of populations and found that our choice of reference data set considerably improves the accuracy of imputation, especially for regions with low LD and for populations without a reference population available as well as for admixed populations such as the Hispanic population. Our method is generic and can potentially be incorporated in any of the available imputation methods as an add-on. Genet. Epidemiol. 34:773-782, 2010. (C) 2010 Wiley-Liss, Inc.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.3

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank

Robin J. Hofmeister, Diogo M. Ribeiro, Simone Rubinacci, Olivier Delaneau

Summary: SHAPEIT5, a new phasing method, accurately processes large sequencing datasets and improves imputation accuracy by generating reference panels of haplotypes. The method was applied to UK Biobank data, which resulted in the identification of 549 genes with compound heterozygous loss-of-function events. The use of UK Biobank as a reference panel, coupled with SHAPEIT5 phasing, further enhances genotype imputation accuracy.

NATURE GENETICS (2023)