4.6 Article

Accurate Computation of Survival Statistics in Genome-Wide Studies

期刊

PLOS COMPUTATIONAL BIOLOGY
卷 11, 期 5, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1004071

关键词

-

资金

  1. NSF [IIS-1016648, IIS-1247581]
  2. NIH [R01HG007069]
  3. Burroughs Wellcome Fund
  4. Alfred P. Sloan Research Fellowship
  5. NSF CAREER [CCF-1053753]
  6. Direct For Computer & Info Scie & Enginr
  7. Division of Computing and Communication Foundations [1053753] Funding Source: National Science Foundation
  8. Direct For Computer & Info Scie & Enginr
  9. Div Of Information & Intelligent Systems [1016648, 1247581] Funding Source: National Science Foundation
  10. Div Of Information & Intelligent Systems
  11. Direct For Computer & Info Scie & Enginr [GRANTS:13980202] Funding Source: National Science Foundation

向作者/读者索取更多资源

A key challenge in genomics is to identify genetic variants that distinguish patients with different survival time following diagnosis or treatment. While the log-rank test is widely used for this purpose, nearly all implementations of the log-rank test rely on an asymptotic approximation that is not appropriate in many genomics applications. This is because: the two populations determined by a genetic variant may have very different sizes; and the evaluation of many possible variants demands highly accurate computation of very small p-values. We demonstrate this problem for cancer genomics data where the standard log-rank test leads to many false positive associations between somatic mutations and survival time. We develop and analyze a novel algorithm, Exact Log-rank Test (ExaLT), that accurately computes the p-value of the log-rank statistic under an exact distribution that is appropriate for any size populations. We demonstrate the advantages of ExaLT on data from published cancer genomics studies, finding significant differences from the reported p-values. We analyze somatic mutations in six cancer types from The Cancer Genome Atlas (TCGA), finding mutations with known association to survival as well as several novel associations. In contrast, standard implementations of the log-rank test report dozens-hundreds of likely false positive associations as more significant than these known associations.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据