4.7 Article

Large-scale motif discovery using DNA Gray code and equiprobable oligomers

期刊

BIOINFORMATICS
卷 28, 期 1, 页码 25-31

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btr606

关键词

-

资金

  1. Japan Society for the Promotion of Science (JSPS) through the Council for Science and Technology Policy (CSTP)
  2. Ministry of Education, Culture, Sports, Science and Technology of Japan [20651053, 221S0002, 22310124]
  3. Grants-in-Aid for Scientific Research [221S0002, 22240032, 20651053, 22310124] Funding Source: KAKEN

向作者/读者索取更多资源

Motivation: How to find motifs from genome-scale functional sequences, such as all the promoters in a genome, is a challenging problem. Word-based methods count the occurrences of oligomers to detect excessively represented ones. This approach is known to be fast and accurate compared with other methods. However, two problems have hampered the application of such methods to large-scale data. One is the computational cost necessary for clustering similar oligomers, and the other is the bias in the frequency of fixed-length oligomers, which complicates the detection of significant words. Results: We introduce a method that uses a DNA Gray code and equiprobable oligomers, which solve the clustering problem and the oligomer bias, respectively. Our method can analyze 18 000 sequences of similar to 1 kbp long in 30 s. We also show that the accuracy of our method is superior to that of a leading method, especially for large-scale data and small fractions of motif-containing sequences.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据