期刊
JOURNAL OF MOLECULAR RECOGNITION
卷 32, 期 5, 页码 -出版社
WILEY
DOI: 10.1002/jmr.2770
关键词
bacterial transcription; bioinformatics; clustering technique; promoters; sigma factor
资金
- Universidade de Caxias do Sul (UCS)
Promoters are DNA sequences located upstream of the transcription start site of genes. In bacteria, the RNA polymerase enzyme requires additional subunits, called sigma factors (sigma) to begin specific gene transcription in distinct environmental conditions. Currently, promoter prediction still poses many challenges due to the characteristics of these sequences. In this paper, the nucleotide content of Escherichia coli promoter sequences, related to five alternative sigma factors, was analyzed by a machine learning technique in order to provide profiles according to the sigma factor which recognizes them. For this, the clustering technique was applied since it is a viable method for finding hidden patterns on a data set. As a result, 20 groups of sequences were formed, and, aided by the Weblogo tool, it was possible to determine sequence profiles. These found patterns should be considered for implementing computational prediction tools. In addition, evidence was found of an overlap between the functions of the genes regulated by different sigma factors, suggesting that DNA structural properties are also essential parameters for further studies.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据