4.0 Article

Efficient Association Study Design Via Power-Optimized Tag SNP Selection

期刊

ANNALS OF HUMAN GENETICS
卷 72, 期 -, 页码 834-847

出版社

WILEY
DOI: 10.1111/j.1469-1809.2008.00469.x

关键词

association study; tag SNP selection; statistical power; single nucleotide polymorphism; linkage disequilibrium

资金

  1. Microsoft Graduate Research Fellowship
  2. National Science Foundation [0513612, 0731455]
  3. National Institutes of Health [1K25HL080079]
  4. Research Facilities Improvement Program [C06 RR017588]
  5. Biomedical Engineering Institute, and the Biomedical Technology Resource Centers Program [P41 RR08605]
  6. National Biomedical Computation Resource, UCSD
  7. National Center for Research Resources, National Institutes of Health
  8. California Institute of Telecommunications and Information Technology
  9. UCSD FWGrid
  10. NSF Research Infrastructure [EIA-0303622]
  11. Direct For Computer & Info Scie & Enginr [0731455] Funding Source: National Science Foundation
  12. Div Of Information & Intelligent Systems [0731455] Funding Source: National Science Foundation

向作者/读者索取更多资源

Discovering statistical correlation between causal genetic variation and clinical traits through association studies is an important method for identifying the genetic basis of human diseases. Since fully resequencing a cohort is prohibitively costly, genetic association studies take advantage of local correlation structure (or linkage disequilibrium) between single nucleotide polymorphisms (SNPs) by selecting a subset of SNPs to be genotyped (tag SNPs). While many current association studies are performed using commercially available high-throughput genotyping products that define a set of tag SNPs, choosing tag SNPs remains an important problem for both custom follow-up studies as well as designing the high-throughput genotyping products themselves. The most widely used tag SNP selection method optimizes the correlation between SNPs (r(2)). However, tag SNPs chosen based on an r(2) criterion do not necessarily maximize the statistical power of an association study. We propose a study design framework that chooses SNPs to maximize power and efficiently measures the power through empirical simulation. Empirical results based on the HapMap data show that our method gains considerable power over a widely used r(2)-based method, or equivalently reduces the number of tag SNPs required to attain the desired power of a study. Our power-optimized 100k whole genome tag set provides equivalent power to the Affymetrix 500k chip for the CEU population. For the design of custom follow-up studies, our method provides up to twice the power increase using the same number of tag SNPs as r(2)-based methods. Our method is publicly available via web server at external link type http://design.cs.ucla.edu.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据