4.5 Article

COE: A General Approach for Efficient Genome-Wide Two-Locus Epistasis Test in Disease Association Study

期刊

JOURNAL OF COMPUTATIONAL BIOLOGY
卷 17, 期 3, 页码 401-415

出版社

MARY ANN LIEBERT, INC
DOI: 10.1089/cmb.2009.0155

关键词

algorithms; computational molecular biology; dynamic programming; genomics; sequence analysis

资金

  1. Div Of Information & Intelligent Systems
  2. Direct For Computer & Info Scie & Enginr [812464] Funding Source: National Science Foundation

向作者/读者索取更多资源

The availability of high-density single nucleotide polymorphisms (SNPs) data has made genome-wide association study computationally challenging. Two-locus epistasis (gene-gene interaction) detection has attracted great research interest as a promising method for genetic analysis of complex diseases. In this article, we propose a general approach, COE, for efficient large scale gene-gene interaction analysis, which supports a wide range of tests. In particular, we show that many commonly used statistics are convex functions. From the observed values of the events in two-locus association test, we can develop an upper bound of the test value. Such an upper bound only depends on single-locus test and the genotype of the SNP-pair. We thus group and index SNP-pairs by their genotypes. This indexing structure can benefit the computation of all convex statistics. Utilizing the upper bound and the indexing structure, we can prune most of the SNP-pairs without compromising the optimality of the result. Our approach is especially efficient for large permutation test. Extensive experiments demonstrate that our approach provides orders of magnitude performance improvement over the brute force approach.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据