4.7 Article

Modeling of the GC content of the substituted bases in bacterial core genomes

期刊

BMC GENOMICS
卷 19, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/s12864-018-4984-3

关键词

Microbial genomics; Core genome; Mathematical modeling; SNP GC content; Core genome GC content; Statistical parameter estimation

资金

  1. Norwegian Institute of Public Health

向作者/读者索取更多资源

Background: The purpose of the present study was to examine the GC content of substituted bases (sbGC) in the core genomes of 35 bacterial species. Each species, or core genome, constituted genomes from at least 10 strains. We also wanted to explore whether sbGC for each strain was associated with the corresponding species' core genome GC content (cgGC). We present a simple mathematical model that estimates sbGC from cgGC. The model assumes only that the estimated sbGC is a function of cgGC proportional to fixed AT -> GC (alpha) and GC -> AT (beta) mutation rates. Non-linear regression was used to estimate parameters alpha and beta from the empirical data described above. Results: We found that sbGC for each strain showed a non-linear association with the corresponding cgGC with a bias towards higher GC content for most core genomes (66.3% of the strains), assuming as a null-hypothesis that sbGC should be approximately equal to cgGC. The most GC rich core genomes (i.e. approximately %GC > 60), on the other hand, exhibited slightly less GC-biased sbGC than expected. The best fitted regression model indicates that GC -> AT mutation rates beta = (1.91 +/- 0.13) p <0.001 are approximately (1.91/0.79) = 2.42 times as high, on average, as AT -> GC alpha = (-0.79 +/- 0.25) p <0.001 mutation rates. Whether the observed sbGC GC-bias for all but the most GC-rich prokaryotic species is due to selection, compensating for the GC -> AT mutation bias, and/or selective neutral processes is currently debated. Residual standard error was found to be sigma = 0.076 indicating estimated errors of sbGC to be approximately within +/- 15.2% GC (95% confidence interval) for the strains of all species in the study. Conclusion: Not only did our mathematical model give reasonable estimates of sbGC it also provides further support to previous observations that mutation rates in prokaryotes exhibit a universal GC -> AT bias that appears to be remarkably consistent between taxa.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据