4.7 Article

Density-based hierarchical clustering of pyro-sequences on a large scale-the case of fungal ITS1

期刊

BIOINFORMATICS
卷 29, 期 10, 页码 1268-1274

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btt149

关键词

-

资金

  1. Swiss Federal Government through the State Secretariat for Education and Research (SER)
  2. Swiss National Science Foundation [31003A-125145, PMPDP3-129027]
  3. University of Lausanne
  4. Swiss National Science Foundation (SNF) [31003A_125145, PMPDP3_129027] Funding Source: Swiss National Science Foundation (SNF)

向作者/读者索取更多资源

Motivation: Analysis of millions of pyro-sequences is currently playing a crucial role in the advance of environmental microbiology. Taxonomy-independent, i.e. unsupervised, clustering of these sequences is essential for the definition of Operational Taxonomic Units. For this application, reproducibility and robustness should be the most sought after qualities, but have thus far largely been overlooked. Results: More than 1 million hyper-variable internal transcribed spacer 1 (ITS1) sequences of fungal origin have been analyzed. The ITS1 sequences were first properly extracted from 454 reads using generalized profiles. Then, otupipe, cd-hit-454, ESPRIT-Tree and DBC454, a new algorithm presented here, were used to analyze the sequences. A numerical assay was developed to measure the reproducibility and robustness of these algorithms. DBC454 was the most robust, closely followed by ESPRIT-Tree. DBC454 features density-based hierarchical clustering, which complements the other methods by providing insights into the structure of the data.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据