☆ 4.7 Article

SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models

BIOINFORMATICS (2021)

期刊

BIOINFORMATICS

卷 37, 期 5, 页码 728-730

出版社

OXFORD UNIV PRESS

DOI: 10.1093/bioinformatics/btaa731

关键词

类别

Biochemical Research Methods Biotechnology & Applied Microbiology Computer Science, Interdisciplinary Applications Mathematical & Computational Biology Statistics & Probability

资金

AbbVie

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

PheWAS studies are powerful tools for discovering and replicating genetic associations, but the computational burden can be reduced using methods like SAIGE. However, analyzing thousands of phenotypes with whole-genome data is still computationally intensive. The new SAIGEgds package offers a faster alternative, especially when used with high-performance computing clusters.

Phenome-wide association studies (PheWASs) are known to be a powerful tool in discovery and replication of genetic association studies. To reduce the computational burden of PheWAS in the large cohorts, such as the UK Biobank, the SAIGE method has been proposed to control for case-control imbalance and sample relatedness in a tractable manner. However, SAIGE is still computationally intensive when deployed in analyzing the associations of thousands of ICD10-coded phenotypes with whole-genome imputed genotype data. Here, we present a new high-performance statistical R package (SAIGEgds) for large-scale PheWAS using generalized linear mixed models. The package implements the SAIGE method in optimized C++ codes, taking advantage of sparse genotype dosages and integrating the efficient genomic data structure file format. Benchmarks using the UK Biobank White British geno-type data (N approximate to 430 K) with coronary heart disease and simulated cases show that the implementation in SAIGEgds is 5-6 times faster than the SAIGE R package. When used in conjunction with high-performance computing clusters, SAIGEgds provides an efficient analysis pipeline for biobank-scale PheWAS.

SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models

期刊

BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

SAIGEgds-an efficient statistical tool for large-scale PheWAS with mixed models

期刊

BIOINFORMATICS

出版社

OXFORD UNIV PRESS

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文