☆ 4.3 Article

Efficient approaches for large-scale GWAS with genotype uncertainty

G3-GENES GENOMES GENETICS (2022)

Journal

G3-GENES GENOMES GENETICS

Volume 12, Issue 1, Pages -

Publisher

OXFORD UNIV PRESS INC

DOI: 10.1093/g3journal/jkab385

Keywords

admixture; association mapping; case-control study; next-generation sequencing; quantitative traits

Funding

Lundbeck Foundation [R215-2015-4174]

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Association studies using genetic data from SNP-chip-based imputation or low-depth sequencing data are cost-efficient for large-scale studies. The proposed method, ANGSD-asso's latent model, models the unobserved genotype as a latent variable in a generalized linear model framework. Using individual allele frequency prior has shown to have better statistical power in structured populations compared to sample allele frequency.

Association studies using genetic data from SNP-chip-based imputation or low-depth sequencing data provide a cost-efficient design for large-scale association studies. We explore methods for performing association studies applicable to such genetic data and investigate how using different priors when estimating genotype probabilities affects the association results. Our proposed method, ANGSD-asso's latent model, models the unobserved genotype as a latent variable in a generalized linear model framework. The software is implemented in C/C++ and can be run multi-threaded. ANGSD-asso is based on genotype probabilities, which can be estimated using either the sample allele frequency or the individual allele frequencies as a prior. We explore through simulations how genotype probability-based methods compare with using genetic dosages. Our simulations show that in a structured population using the individual allele frequency prior has better power than the sample allele frequency. In scenarios with sequencing depth and phenotype correlation ANGSD-asso's latent model has higher statistical power and less bias than using dosages. Adding additional covariates to the linear model of ANGSD-asso's latent model has higher statistical power and less bias than other methods that accommodate genotype uncertainty, while also being much faster. This is shown with imputed data from UK Biobank and simulations.

Efficient approaches for large-scale GWAS with genotype uncertainty

Journal

G3-GENES GENOMES GENETICS

Publisher

OXFORD UNIV PRESS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Efficient approaches for large-scale GWAS with genotype uncertainty

Journal

G3-GENES GENOMES GENETICS

Publisher

OXFORD UNIV PRESS INC

Keywords

Categories

Funding

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper