4.6 Article

GenSeed-HMM: A Tool for Progressive Assembly Using Profile HMMs as Seeds and its Application in Alpavirinae Viral Discovery from Metagenomic Data

期刊

FRONTIERS IN MICROBIOLOGY
卷 7, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fmicb.2016.00269

关键词

Aipavirinae; sequence assembly; metagenomic analysis; viral discovery; de novo diagnosis

资金

  1. National Council for Scientific and Technological Development (CNPq)
  2. PIBIC/CNPq
  3. CAPES
  4. Sao Paulo Research Foundation (FAPESP) [2013/14622-3]
  5. PAPA internal funding at Universidad de los Andes
  6. Colciencias
  7. Sao Paulo Research Foundation - FAPESP [2010/04609-1]
  8. School of Sciences at Universidad de los Andes

向作者/读者索取更多资源

This work reports the development of GenSeed-HMM, a program that implements seed-driven progressive assembly, an approach to reconstruct specific sequences from unassembled data, starting from short nucleotide or protein seed sequences or profile Hidden Markov Models (HMM). The program can use any one of a number of sequence assemblers. Assembly is performed in multiple steps and relatively few reads are used in each cycle, consequently the program demands low computational resources. As a proof-of-concept and to demonstrate the power of HMM-driven progressive assemblies, GenSeed-HMM was applied to metagenomic datasets in the search for diverse ssDNA bacteriophages from the recently described Alpavirinae subfamily. Profile HMMs were built using Alpavinnae-specific regions from multiple sequence alignments (MSA) using either the viral protein 1 (VP1; major capsid protein) or VP4 (genome replication initiation protein). These profile HMMs were used by GenSeed-HMM (running Newbler assembler) as seeds to reconstruct viral genomes from sequencing datasets of human fecal samples. All contigs obtained were annotated and taxonomically classified using similarity searches and phylogenetic analyses. The most specific profile HMM seed enabled the reconstruction of 45 partial or complete Alpavinnae genomic sequences. A comparison with conventional (global) assembly of the same original dataset, using Newbler in a standalone execution, revealed that GenSeed-HMM outperformed global genomic assembly in several metrics employed. This approach is capable of detecting organisms that have not been used in the construction of the profile HMM, which opens up the possibility of diagnosing novel viruses, without previous specific information, constituting a de novo diagnosis. Additional applications include, but are not limited to, the specific assembly of extrachromosomal elements such as plastid and mitochondrial genomes from metagenomic data. Profile HMM seeds can also be used to reconstruct specific protein coding genes for gene diversity studies, and to determine all possible gene variants present in a metagenomic sample. Such surveys could be useful to detect the emergence of drug-resistance variants in sensitive environments such as hospitals and animal production facilities, where antibiotics are regularly used. Finally, GenSeed-HMM can be used as an adjunct for gap closure on assembly finishing projects, by using multiple contig ends as anchored seeds.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Food Science & Technology

Comparing novel shotgun DNA sequencing and state-of-the-art proteomics approaches for authentication of fish species in mixed samples

Madhushri S. Varunjikar, Carlos Moreno-Ibarguen, Juan S. Andrade-Martinez, Hui-Shan Tung, Ikram Belghit, Magnus Palmblad, Pal A. Olsvik, Alejandro Reyes, Josef D. Rasinger, Kai K. Lie

Summary: The study shows that both DNA and protein-based approaches can efficiently tackle current challenges in feed and food authentication analyses.

FOOD CONTROL (2022)

Article Biochemistry & Molecular Biology

Novel Insights on Obligate Symbiont Lifestyle and Adaptation to Chemosynthetic Environment as Revealed by the Giant Tubeworm Genome

Andre Luiz de Oliveira, Jessica Mitchell, Peter Girguis, Monika Bright

Summary: The study presents the high-quality draft genome of the giant tubeworm Riftia pachyptila, revealing signs of reductive evolution and evolutionary adaptations to the vent environment and endosymbiosis. The conservation of developmental gene repertoire in the gutless tubeworm and the role of innate immune system in establishing symbiosis are highlighted. The research bridges four decades of physiological research in Riftia and sheds new light on development, whole organism functions, and evolution in the giant tubeworm.

MOLECULAR BIOLOGY AND EVOLUTION (2022)

Article Microbiology

The Importance of Glycerophospholipid Production to the Mutualist Symbiosis of Trypanosomatids

Allan C. de Azevedo-Martins, Kary Ocana, Wanderley de Souza, Ana Tereza Ribeiro de Vasconcelos, Marta M. G. Teixeira, Erney P. Camargo, Joao M. P. Alves, Maria Cristina M. Motta

Summary: The symbiotic relationship between trypanosomatids and bacteria involves extensive metabolic exchanges, with the bacteria providing essential metabolic pathways for the protozoan. An in-silico study found that most genes involved in glycerophospholipid production are only present in the Symbiont Harboring Trypanosomatids (SHTs) and not in the bacteria. The bacterium has specific sequences and genes related to phosphatidylglycerol and phosphatidic acid production, which likely enhance SHT phosphatidic acid production. Phylogenetic analysis suggests that enzymes involved in the glycerophospholipid pathway have eukaryotic characteristics, indicating no gene transfers from the bacterium to the SHT nucleus. Overall, the data indicate that the symbiont plays a limited role in glycerophospholipid production, acquiring most of these molecules from the SHT.

PATHOGENS (2022)

Review Microbiology

Computational Tools for the Analysis of Uncultivated Phage Genomes

Juan Sebastian Andrade-Martinez, Laura Carolina Camelo Valera, Luis Alberto Chica Cardenas, Laura Forero-Junco, Gamaliel Lopez-Leal, J. Leonardo Moreno-Gallego, Guillermo Rangel-Pineros, Alejandro Reyes

Summary: Over a century of bacteriophage research has uncovered fundamental aspects of their biology, ecology, and evolution. The introduction of community-level studies through metagenomics has revealed unprecedented insights on the impact that phages have on ecological and physiological processes. The availability of computational tools has greatly contributed to our knowledge of phage diversity and ecology, but the ongoing surge in software programs makes it challenging to keep up with them.

MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS (2022)

Article Biochemistry & Molecular Biology

A Phakopsora pachyrhizi Effector Suppresses PAMP-Triggered Immunity and Interacts with a Soybean Glucan Endo-1,3-β-Glucosidase to Promote Virulence

Thays Bueno, Patricia P. Fontes, Valeria Y. Abe, Alice Satiko Utiyama, Renato L. Senra, Liliane S. Oliveira, Adriana Brombini dos Santos, Everton G. Capote Ferreira, Luana M. Darben, Aluizio Borem de Oliveira, Ricardo Abdelnoor, Steven A. Whitham, Luciano G. Fietto, Francismar C. Marcelino-Guimaraes

Summary: This study investigates the pathogenic mechanisms of Asian soybean rust, focusing on the interaction between the effector protein Phapa7431740 and a soybean protein Gm beta GLU. The results show that Phapa-7431740 suppresses host immune response and interacts with Gm beta GLU. The findings suggest that Phapa-7431740 may inhibit immune response by interfering with the activity of glucan endo-1,3-beta-glucosidase.

MOLECULAR PLANT-MICROBE INTERACTIONS (2022)

Article Biochemistry & Molecular Biology

Molecular dynamics simulations of the SARS-CoV-2 Spike protein and variants of concern: structural evidence for convergent adaptive evolution

Daniel Ferreira de Lima Neto, Vagner Fonseca, Ronaldo Jesus, Leonardo Hermes Dutra, Layssa Miranda de Oliveria Portela, Carla Freitas, Eduardo Fillizola, Breno Soares, Andre Luiz de Abreu, Sandeep Twiari, Vasco Azevedo, Aristoteles Goes-Neto, Arnaldo Correia de Medeiros, Norberto Peporine Lopes, Paolo Marinho de Andrade Zanotto, Rodrigo Bentes Kato

Summary: This study used the structure of SARS-CoV-2 spike protein to examine the impact of mutations on the protein's stability and interaction with ACE-2 receptor. Molecular dynamics simulations and protein-protein docking experiments revealed that the mutations affected the stability of the protein and improved its interaction with ACE-2 receptor, particularly in the Gamma variant.

JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS (2023)

Article Biochemistry & Molecular Biology

The complete and closed genome of the facultative generalist Candidatus Endoriftia persephone from deep-sea hydrothermal vents

Andre Luiz De Oliveira, Abhishek Srivastava, Salvador Espada-Hinojosa, Monika Bright

Summary: This study presents the closed chromosomal sequence of the endosymbiont Endoriftia using single-molecule real-time sequencing. The genome of Endoriftia is smaller than expected and shows versatility in sulfur metabolism. The presence of rRNA operons, CRISPR regions, and different secretion systems suggests lineage-specific adaptations. The study also highlights the importance of molecular memory-based immunity against phages in Endoriftia.

MOLECULAR ECOLOGY RESOURCES (2022)

Article Virology

Rational Design of Profile HMMs for Sensitive and Specific Sequence Detection with Case Studies Applied to Viruses, Bacteriophages, and Casposons

Liliane S. S. Oliveira, Alejandro Reyes, Bas E. E. Dutilh, Arthur Gruber

Summary: The study developed protocols for the rational design of profile HMMs, which can automatically identify informative sequence motifs and construct profile HMMs. These methods were applied to detect and classify different viral groups and related transposable elements.

VIRUSES-BASEL (2023)

Article Microbiology

Genomic and Evolutionary Features of Nine AHPND Positive Vibrio parahaemolyticus Strains Isolated from South American Shrimp Farms

Alejandro Castellanos, Leda Restrepo, Leandro Bajana, Irma Betancourt, Bonny Bayot, Alejandro Reyes

Summary: AHPND is a disease causing significant losses in the shrimp farming industry, with global losses exceeding $2.6 billion. The most common etiological agent is V. parahaemolyticus strains carrying the PirAB(vp) toxin. By analyzing South American AHPND-causing V. parahaemolyticus isolates at the genomic level, it was found that they have high similarity but do not cluster with other Mexican strains, suggesting different genetic backgrounds and possible acquisition of the pVA1-type plasmid through horizontal gene transfer at different times.

MICROBIOLOGY SPECTRUM (2023)

Article Infectious Diseases

Assessment of the Risk of Exotic Zika Virus Strain Transmission by Aedes aegypti and Culex quinquefasciatus from Senegal Compared to a Native Strain

Alioune Gaye, Cheikh Fall, Oumar Faye, Myrielle Dupont-Rouzeyrol, El Hadji Ndiaye, Diawo Diallo, Paolo Marinho de Andrade Zanotto, Ibrahima Dia, Scott C. Weaver, Mawlouth Diallo

Summary: This study assessed the susceptibility of A. aegypti and C. quinquefasciatus to ZIKV strains from Senegal, Brazil, and New Caledonia, and found that the Senegalese strain had a significantly higher infection rate compared to the Brazilian and New Caledonian strains. No infection was recorded for C. quinquefasciatus.

TROPICAL MEDICINE AND INFECTIOUS DISEASE (2023)

Letter Biotechnology & Applied Microbiology

Guidelines for public database submission of uncultivated virus genome sequences for taxonomic classification

Evelien M. M. Adriaenssens, Simon Roux, J. Rodney Brister, Ilene Karsch-Mizrachi, Jens H. H. Kuhn, Arvind Varsani, Tong Yigang, Alejandro Reyes, Cedric Lood, Elliot J. J. Lefkowitz, Matthew B. B. Sullivan, Robert A. A. Edwards, Peter Simmonds, Luisa Rubino, Sead Sabanadzovic, Mart Krupovic, Bas E. E. Dutilh

NATURE BIOTECHNOLOGY (2023)

Correction Biotechnology & Applied Microbiology

Guidelines for public database submission of uncultivated virus genome sequences for taxonomic classification (vol 41, pg 898, 2023)

Evelien M. M. Adriaenssens, Simon Roux, J. Rodney Brister, Ilene Karsch-Mizrachi, Jens H. Kuhn, Arvind Varsani, Tong Yigang, Alejandro Reyes, Cedric Lood, Elliot J. Lefkowitz, Matthew B. B. Sullivan, Robert A. A. Edwards, Peter Simmonds, Luisa Rubino, Sead Sabanadzovic, Mart Krupovic, Bas E. E. Dutilh

NATURE BIOTECHNOLOGY (2023)

Article Biochemical Research Methods

VIRify: An integrated detection, annotation and taxonomic classification pipeline using virus-specific protein profile hidden Markov models

Guillermo Rangel-Pineros, Alexandre Almeida, Martin Beracochea, Ekaterina Sakharova, Manja Marz, Alejandro Reyes Munoz, Martin Hoelzer, Robert D. Finn

Summary: VIRify is a computational pipeline that accurately characterizes the functional and taxonomic aspects of viral communities based on sequencing data. It utilizes viral profile hidden Markov models to identify and classify viral contigs, providing taxonomic classifications at different ranks.

PLOS COMPUTATIONAL BIOLOGY (2023)

Article Microbiology

Complete genome sequence of the archetype bile acid 7a-dehydroxylating bacterium, Clostridium scindens VPI12708, isolated from human feces, circa 1980

Kelly Yovani Olivos Caicedo, Francelys V. Fernandez-Materan, Alvaro G. Hernandez, Steven L. Daniel, Joao M. P. Alves, Jason M. Ridlon

Summary: Clostridium scindens strain VPI12708 is used as a model organism to study bile acid 7a-dehydroxylating pathways. The closed circular genome of C. scindens VPI12708, with 3,983,052 bp and 47.59% G + C, was obtained by PacBio sequencing. A total of 3,707 coding DNA sequences are predicted in the genome.

MICROBIOLOGY RESOURCE ANNOUNCEMENTS (2023)

Article Biochemistry & Molecular Biology

Comparative genomics of a vertically transmitted thiotrophic bacterial ectosymbiont and its close free-living relative

Salvador Espada-Hinojosa, Clarissa Karthauser, Abhishek Srivastava, Lukas Schuster, Teresa Winter, Andre Luiz de Oliveira, Frederik Schulz, Matthias Horn, Stefan Sievert, Monika Bright

Summary: This study reveals the impact of strict host dependence on genome evolution and host adaptation of a bacterial ectosymbiont. Thiobius has a smaller genome, reduced metabolic capabilities, and fewer functional traits compared to its free-living relative ODIII6. The differences in functional capabilities at the gene, metabolic pathway, and trait levels between Thiobius and ODIII6 illustrate adaptations to different environmental conditions.

MOLECULAR ECOLOGY RESOURCES (2023)

暂无数据