4.8 Article

Species abundance information improves sequence taxonomy classification accuracy

期刊

NATURE COMMUNICATIONS
卷 10, 期 -, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41467-019-12669-6

关键词

-

资金

  1. NSF [1565100, 1565057]
  2. NHMRC [APP1085372]
  3. Direct For Biological Sciences
  4. Div Of Biological Infrastructure [1565100] Funding Source: National Science Foundation
  5. Direct For Biological Sciences
  6. Div Of Biological Infrastructure [1565057] Funding Source: National Science Foundation

向作者/读者索取更多资源

Popular naive Bayes taxonomic classifiers for amplicon sequences assume that all species in the reference database are equally likely to be observed. We demonstrate that classification accuracy degrades linearly with the degree to which that assumption is violated, and in practice it is always violated. By incorporating environment-specific taxonomic abundance information, we demonstrate a significant increase in the species-level classification accuracy across common sample types. At the species level, overall average error rates decline from 25% to 14%, which is favourably comparable to the error rates that existing classifiers achieve at the genus level (16%). Our findings indicate that for most practical purposes, the assumption that reference species are equally likely to be observed is untenable. q2-clawback provides a straightforward alternative for samples from common environments.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据