4.4 Article

Neutrality Tests for Sequences with Missing Data

期刊

GENETICS
卷 191, 期 4, 页码 1397-U511

出版社

GENETICS SOC AM
DOI: 10.1534/genetics.112.139949

关键词

-

资金

  1. Consolider grant [CSD2007-00036]
  2. Consejo Superior de Investigaciones Cientificas (Spain) under the JAE-doc program
  3. [CGL2009-09346]
  4. [AG2010-14822]

向作者/读者索取更多资源

Missing data are common in DNA sequences obtained through high-throughput sequencing. Furthermore, samples of low quality or problems in the experimental protocol often cause a loss of data even with traditional sequencing technologies. Here we propose modified estimators of variability and neutrality tests that can be naturally applied to sequences with missing data, without the need to remove bases or individuals from the analysis. Modified statistics include the Watterson estimator theta(W), Tajima's D, Fay and Wu's H, and HKA. We develop a general framework to take missing data into account in frequency spectrum-based neutrality tests and we derive the exact expression for the variance of these statistics under the neutral model. The neutrality tests proposed here can also be used as summary statistics to describe the information contained in other classes of data like DNA microarrays.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据