4.8 Article

Evolutionary superscaffolding and chromosome anchoring to improve Anopheles genome assemblies

期刊

BMC BIOLOGY
卷 18, 期 1, 页码 -

出版社

BMC
DOI: 10.1186/s12915-019-0728-3

关键词

Genome assembly; Gene synteny; Comparative genomics; Mosquito genomes; Orthology; Bioinformatics; Computational evolutionary biology; Chromosomes; Physical mapping

类别

资金

  1. US National Institutes of Health (NIH) National Institute of Allergy and Infectious Diseases (NIAID) [R21 AI112734]
  2. US NIH NIAID [R21AI099528, R21AI135298]
  3. US Department of Agriculture National Institute of Food and Agriculture Hatch project [223822]
  4. US National Science Foundation (NSF) [IIS-1462107]
  5. US NSF [CCF-1053753, DBI-1350041, DEB-1249633]
  6. US NIH [U24CA211000, R01-HG006677]
  7. French Agence Nationale pour la Recherche Ancestrome [ANR-10-BINF-01-01]
  8. Intramural Research Program of the NIH National Human Genome Research Institute [1ZIAHG200398]
  9. Mitacs Globalink grant
  10. Natural Sciences and Engineering Research Council of Canada [RGPIN-249834]
  11. Novartis Foundation [18B116]
  12. Swiss National Science Foundation [PP00P3_170664]
  13. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [ZIAHG200398] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Background New sequencing technologies have lowered financial barriers to whole genome sequencing, but resulting assemblies are often fragmented and far from 'finished'. Updating multi-scaffold drafts to chromosome-level status can be achieved through experimental mapping or re-sequencing efforts. Avoiding the costs associated with such approaches, comparative genomic analysis of gene order conservation (synteny) to predict scaffold neighbours (adjacencies) offers a potentially useful complementary method for improving draft assemblies. Results We evaluated and employed 3 gene synteny-based methods applied to 21 Anopheles mosquito assemblies to produce consensus sets of scaffold adjacencies. For subsets of the assemblies, we integrated these with additional supporting data to confirm and complement the synteny-based adjacencies: 6 with physical mapping data that anchor scaffolds to chromosome locations, 13 with paired-end RNA sequencing (RNAseq) data, and 3 with new assemblies based on re-scaffolding or long-read data. Our combined analyses produced 20 new superscaffolded assemblies with improved contiguities: 7 for which assignments of non-anchored scaffolds to chromosome arms span more than 75% of the assemblies, and a further 7 with chromosome anchoring including an 88% anchored Anopheles arabiensis assembly and, respectively, 73% and 84% anchored assemblies with comprehensively updated cytogenetic photomaps for Anopheles funestus and Anopheles stephensi. Conclusions Experimental data from probe mapping, RNAseq, or long-read technologies, where available, all contribute to successful upgrading of draft assemblies. Our evaluations show that gene synteny-based computational methods represent a valuable alternative or complementary approach. Our improved Anopheles reference assemblies highlight the utility of applying comparative genomics approaches to improve community genomic resources.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据