期刊
BIOINFORMATICS
卷 28, 期 5, 页码 679-686出版社
OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/bts017
关键词
-
类别
资金
- James Burgess Scholarship
- Holbeck Trust
- BBSRC [BB/F003099/1, BB/D525056/1] Funding Source: UKRI
- Biotechnology and Biological Sciences Research Council [BB/D525056/1, BB/F003099/1] Funding Source: researchfish
Motivation: Next-generation sequencing methods are generating increasingly massive datasets, yet still do not fully capture genetic diversity in the richest environments. To understand such complicated and elusive systems, effective tools are needed to assist with delineating the differences found in and between community datasets. Results: The Small Subunit Markov Modeler (SSuMMo) was developed to probabilistically assign SSU rRNA gene fragments from any sequence dataset to recognized taxonomic clades, producing consistent, comparable cladograms. Accuracy tests predicted > 90% of genera correctly for sequences downloaded from public reference databases. Sequences from a next-generation sequence dataset, sampled from lean, overweight and obese individuals, were analysed to demonstrate parallel visualization of comparable datasets. SSuMMo shows potential as a valuable curatorial tool, as numerous incorrect and outdated taxonomic entries and annotations were identified in public databases. Availability and implementation: SSuMMo is GPLv3 open source Python software, available at http://code.google.com/p/ssummo/. Taxonomy and HMM databases can be downloaded from http://bioltfws1.york.ac.uk/ssummo/.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据