4.1 Article

Predicting bacterial essential genes using only sequence composition information

期刊

GENETICS AND MOLECULAR RESEARCH
卷 13, 期 2, 页码 4564-4572

出版社

FUNPEC-EDITORA
DOI: 10.4238/2014.June.17.8

关键词

Essential gene prediction; Compositional features; Library for support vector machine; Bacteria

资金

  1. program for New Century Excellent Talents in University [NCET-11-0059]
  2. Key Technology Research and Development Program of Sichuan Province [2011FZ0034]
  3. National Natural Science Foundation of China [31071109, 60801058]
  4. China Postdoctoral Science Foundation [201104687, 2013M540705]

向作者/读者索取更多资源

Essential genes are those genes that are needed by organisms at any time and under any conditions. It is very important for us to identify essential genes from bacterial genomes because of their vital role in synthetic biology and biomedical practices. In this paper, we developed a support vector machine (SVM)-based method to predict essential genes of bacterial genomes using only compositional features. These features are all derived from the primary sequences, i.e., nucleotide sequences and protein sequences. After training on the multiple samplings of the labeled (essential or not essential) features using a library for SVM, we obtained an average area under the ROC curve (AUC) of about 0.82 in a 5-fold cross-validation for Escherichia coli and about 0.74 for Mycoplasma pulmonis. We further evaluated the performance of the method proposed using the dataset consisting of 16 bacterial genomes, and an average AUC of 0.76 was achieved. Based on this training dataset, a model for essential gene prediction was established. Another two independent genomes, Shewanella oneidensis RW1 and Salmonella enterica serovar Typhimurium SL1344 were used to evalutate the model. Results showed that the AUC sores were 0.77 and 0.81, respectively. For the convenience of the vast majority of experimental scientists, a web server has been constructed, which is freely available at http://cefg.uestc.edu.cn:9999/egp.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.1
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据