4.7 Article Proceedings Paper

Analysis of composition-based metagenomic classification

期刊

BMC GENOMICS
卷 13, 期 -, 页码 -

出版社

BMC
DOI: 10.1186/1471-2164-13-S5-S1

关键词

-

向作者/读者索取更多资源

Background: An essential step of a metagenomic study is the taxonomic classification, that is, the identification of the taxonomic lineage of the organisms in a given sample. The taxonomic classification process involves a series of decisions. Currently, in the context of metagenomics, such decisions are usually based on empirical studies that consider one specific type of classifier. In this study we propose a general framework for analyzing the impact that several decisions can have on the classification problem. Instead of focusing on any specific classifier, we define a generic score function that provides a measure of the difficulty of the classification task. Using this framework, we analyze the impact of the following parameters on the taxonomic classification problem: (i) the length of n-mers used to encode the metagenomic sequences, (ii) the similarity measure used to compare sequences, and (iii) the type of taxonomic classification, which can be conventional or hierarchical, depending on whether the classification process occurs in a single shot or in several steps according to the taxonomic tree. Results: We defined a score function that measures the degree of separability of the taxonomic classes under a given configuration induced by the parameters above. We conducted an extensive computational experiment and found out that reasonable values for the parameters of interest could be (i) intermediate values of n, the length of the n-mers; (ii) any similarity measure, because all of them resulted in similar scores; and (iii) the hierarchical strategy, which performed better in all of the cases. Conclusions: As expected, short n-mers generate lower configuration scores because they give rise to frequency vectors that represent distinct sequences in a similar way. On the other hand, large values for n result in sparse frequency vectors that represent differently metagenomic fragments that are in fact similar, also leading to low configuration scores. Regarding the similarity measure, in contrast to our expectations, the variation of the measures did not change the configuration scores significantly. Finally, the hierarchical strategy was more effective than the conventional strategy, which suggests that, instead of using a single classifier, one should adopt multiple classifiers organized as a hierarchy.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Infectious Diseases

Zika virus disrupts gene expression in human myoblasts and myotubes: Relationship with susceptibility to infection

Ingo Riederer, Daniella Areas Mendes-da-Cruz, Guilherme Cordenonsi da Fonseca, Mariela Natacha Gonzalez, Otavio Brustolini, Cassia Rocha, Guilherme Loss, Joseane Biso de Carvalho, Mariane Talon Menezes, Lidiane Menezes Souza Raphael, Alexandra Gerber, Myrna Cristina Bonaldo, Gillian Butler-Browne, Vincent Mouly, Vinicius Cotta-de-Almeida, Wilson Savino, Ana Tereza Ribeiro de Vasconcelos

Summary: This study investigated the mechanisms of Zika virus infection in human skeletal muscle using an in vitro model. The research found that myoblasts are permissive to ZIKV infection, while myotubes control viral replication. Gene expression profiling revealed differences between infected myoblasts and myotubes, with the latter showing pathways related to antiviral and innate immune responses. This study sheds light on potential antiviral mechanisms against ZIKV infection in skeletal muscle.

PLOS NEGLECTED TROPICAL DISEASES (2022)

Article Microbiology

Seroprevalence, Prevalence, and Genomic Surveillance: Monitoring the Initial Phases of the SARS-CoV-2 Pandemic in Betim, Brazil

Ana Valesca Fernandes Gilson Silva, Diego Menezes, Filipe Romero Rebello Moreira, Octavio Alcantara Torres, Paula Luize Camargos Fonseca, Rennan Garcias Moreira, Hugo Jose Alves, Vivian Ribeiro Alves, Tania Maria de Resende Amaral, Adriano Neves Coelho, Julia Maria Saraiva Duarte, Augusto Viana da Rocha, Luiz Gonzaga Paula de Almeida, Joao Locke Ferreira de Araujo, Hilton Soares de Oliveira, Nova Jersey Claudio de Oliveira, Camila Zolini, Josy Hubner de Sousa, Elizangela Goncalves de Souza, Rafael Marques de Souza, Luciana de Lima Ferreira, Alexandra Lehmkuhl Gerber, Ana Paula de Campos Guimaraes, Paulo Henrique Silva Maia, Fernanda Martins Marim, Lucyene Miguita, Cristiane Campos Monteiro, Tuffi Saliba Neto, Fabricia Soares Freire Pugedo, Daniel Costa Queiroz, Damares Nigia Alborguetti Cuzzuol Queiroz, Luciana Cunha Resende-Moreira, Franciele Martins Santos, Erika Fernanda Carlos Souza, Carolina Moreira Voloch, Ana Tereza Vasconcelos, Renato Santana de Aguiar, Renan Pedra de Souza

Summary: This study conducted epidemiological monitoring using diverse strategies to describe the initial stages of the COVID-19 pandemic in Betim City, Brazil. The results revealed the prevalence of the virus, risk factors for positivity, and multiple viral introductions.

FRONTIERS IN MICROBIOLOGY (2022)

Article Biology

Future-proofing and maximizing the utility of metadata: The PHA4GE SARS-CoV-2 contextual data specification package

Emma J. Griffiths, Ruth E. Timme, Catarina Ines Mendes, Andrew J. Page, Nabil-Fareed Alikhan, Dan Fornika, Finlay Maguire, Josefina Campos, Daniel Park, Idowu B. Olawoye, Paul E. Oluniyi, Dominique Anderson, Alan Christoffels, Anders Goncalves da Silva, Rhiannon Cameron, Damion Dooley, Lee S. Katz, Allison Black, Ilene Karsch-Mizrachi, Tanya Barrett, Anjanette Johnston, Thomas R. Connor, Samuel M. Nicholls, Adam A. Witney, Gregory H. Tyson, Simon H. Tausch, Amogelang R. Raphenya, Brian Alcock, David M. Aanensen, Emma Hodcroft, William W. L. Hsiao, Ana Tereza R. Vasconcelos, Duncan R. MacCannell

Summary: PHA4GE is a global coalition working to improve openness, interoperability, and consistency in public health microbial bioinformatics. They have developed a SARS-CoV-2 contextual data specification package to support data collection and harmonization in public biorepositories.

GIGASCIENCE (2022)

Article Genetics & Heredity

Genomic Perspectives on the Emerging SARS-CoV-2 Omicron Variant

Wentai Ma, Jing Yang, Haoyi Fu, Chao Su, Caixia Yu, Qihui Wang, Ana Tereza Ribeiro de Vasconcelos, Georgii A. Bazykin, Yiming Bao, Mingkun Li

Summary: This study analyzed the viral genome of the Omicron variant and found it to have numerous mutations, especially in the Spike gene. These mutations may affect the replication, infectivity, and antigenicity of SARS-CoV-2. The Omicron variant has 53 mutations compared to its closest sequences in public databases, many of which are rare. Strengthening global genomic surveillance and data sharing is crucial for detecting and tracking the source of new variants.

GENOMICS PROTEOMICS & BIOINFORMATICS (2022)

Article Microbiology

The Importance of Glycerophospholipid Production to the Mutualist Symbiosis of Trypanosomatids

Allan C. de Azevedo-Martins, Kary Ocana, Wanderley de Souza, Ana Tereza Ribeiro de Vasconcelos, Marta M. G. Teixeira, Erney P. Camargo, Joao M. P. Alves, Maria Cristina M. Motta

Summary: The symbiotic relationship between trypanosomatids and bacteria involves extensive metabolic exchanges, with the bacteria providing essential metabolic pathways for the protozoan. An in-silico study found that most genes involved in glycerophospholipid production are only present in the Symbiont Harboring Trypanosomatids (SHTs) and not in the bacteria. The bacterium has specific sequences and genes related to phosphatidylglycerol and phosphatidic acid production, which likely enhance SHT phosphatidic acid production. Phylogenetic analysis suggests that enzymes involved in the glycerophospholipid pathway have eukaryotic characteristics, indicating no gene transfers from the bacterium to the SHT nucleus. Overall, the data indicate that the symbiont plays a limited role in glycerophospholipid production, acquiring most of these molecules from the SHT.

PATHOGENS (2022)

Article Public, Environmental & Occupational Health

Emergence of Within-Host SARS-CoV-2 Recombinant Genome After Coinfection by Gamma and Delta Variants: A Case Report

Ronaldo da Silva Francisco Junior, Luiz G. P. de Almeida, Alessandra P. Lamarca, Liliane Cavalcante, Yasmmin Martins, Alexandra L. Gerber, Ana Paula de C. Guimaraes, Ricardo Barbosa Salviano, Fernanda Leitao dos Santos, Thiago Henrique de Oliveira, Isabelle Vasconcellos de Souza, Erika Martins de Carvalho, Mario Sergio Ribeiro, Silvia Carvalho, Flavio Dias da Silva, Marcio Henrique de Oliveira Garcia, Leandro Magalhaes de Souza, Cristiane Gomes da Silva, Caio Luiz Pereira Ribeiro, Andrea Cony Cavalcanti, Claudia Maria Braga de Mello, Amilcar Tanuri, Ana Tereza R. Vasconcelos

Summary: This study reports the first case of intra-host SARS-CoV-2 recombination between the variants of concern AY.33 (Delta) and P.1 (Gamma) during coinfection. By analyzing sequencing reads that contain lineage-defining mutations from both variants, the researchers identified six recombinant regions in the SARS-CoV-2 genome within a sample, four in the spike gene and two in the nucleocapsid gene. This represents a potential threat to public health management during the COVID-19 pandemic due to the emergence of viruses with recombinant phenotypes.

FRONTIERS IN PUBLIC HEALTH (2022)

Article Pediatrics

Clinical and genetic findings in two siblings with X-Linked agammaglobulinemia and bronchiolitis obliterans: a case report

Ronaldo da Silva Francisco Junior, Guilherme Loss de Morais, Joseane Biso de Carvalho, Cristina dos Santos Ferreira, Alexandra Lehmkuhl Gerber, Ana Paula de C. Guimaraes, Flavia Anisio Amendola, Fernanda Pinto-Mariz, Zilton Farias Meira de Vasconcelos, Ekaterini Simoes Goudouris, Ana Tereza Ribeiro de Vasconcelos

Summary: Our report highlights the importance of whole-exome sequencing (WES) in patients with known inborn errors of immunity, but uncommon clinical presentations. We identified a rare hemizygous missense variant in the BTK gene and a gain-of-function mutation in TGF beta 1, indicating a more complex genetic landscape underlying X-linked agammaglobulinemia (XLA) and bronchiolitis obliterans. This personalized understanding of the genetic basis may have implications for potential treatments and prognosis.

BMC PEDIATRICS (2022)

Article Microbiology

Antimicrobial resistance and genetic background of non-typhoidal Salmonella enterica strains isolated from human infections in Sao Paulo, Brazil (2000-2019)

Aline Parolin Calarga, Marco Tulio Pardini Gontijo, Luiz Gonzaga Paula de Almeida, Ana Tereza Ribeiro de Vasconcelos, Leandro Costa Nascimento, Taise Marongio Cotrim de Moraes Barbosa, Thalita Mara de Carvalho Perri, Silvia Regina dos Santos, Monique Ribeiro Tiba-Casas, Eneida Goncalves Lemes Marques, Cleide Marques Ferreira, Marcelo Brocchi

Summary: This study characterizes the antimicrobial-resistant phenotype of non-typhoidal S. enterica strains isolated from human infections in Sao Paulo, Brazil over a 20-year period. The findings reveal that a small percentage of isolates show multidrug resistance and pathogenic phenotypes, and contain antimicrobial resistance genes that could potentially be disseminated among other bacterial strains.

BRAZILIAN JOURNAL OF MICROBIOLOGY (2022)

Article Biotechnology & Applied Microbiology

Genomic analyses of ciprofloxacin-resistant Neisseria gonorrhoeae isolates recovered from the largest South American metropolitan area

Dandara Cassu-Corsi, Fernanda F. Santos, Rodrigo Cayo, Willames M. B. S. Martins, Carolina S. Nodari, Luiz G. P. Almeida, Rafael A. Martins, Roberto J. Carvalho da Silva, Ana Tereza R. Vasconcelos, Antonio C. C. Pignatari, Ana C. Gales

Summary: This study sequenced Neisseria gonorrhoeae isolates collected in Sao Paulo, Brazil and identified different sequence types and resistance mutations. The results contribute to the understanding of N. gonorrhoeae strains circulating in the region.

GENOMICS (2022)

Article Infectious Diseases

Kinetics Analysis of β-Lactams Hydrolysis by OXA-50 Variants of Pseudomonas aeruginosa

Ana Paula Streling, Rodrigo Cayo, Carolina S. Nodari, Luiz G. P. Almeida, Felipe Bronze, Andre Valencio Siqueira, Adriana P. Matos, Vitor Oliveira, Ana Tereza R. Vasconcelos, Marcelo F. M. Marcondes, Ana Cristina Gales

Summary: Pseudomonas aeruginosa is often associated with life-threatening infections due to its intrinsic and acquired antimicrobial mechanisms, including different types of beta-lactamases. The study found that OXA-488 has a higher catalytic efficiency for benzylpenicillin and imipenem compared to OXA-50, although its carbapenemase activity is considered weak. Additionally, OXA-488 and OXA-494 have increased affinity for penicillins, contributing to improved catalytic efficiency against ampicillin.

MICROBIAL DRUG RESISTANCE (2022)

Article Virology

The Role of Lebanon in the COVID-19 Butterfly Effect: The B.1.398 Example

Dalal Nour, Rayane Rafei, Alessandra P. Lamarca, Luiz G. P. de Almeida, Marwan Osman, Mohamad Bachar Ismail, Hassan Mallat, Atika Berry, Gwendolyne Burfin, Quentin Semanas, Laurence Josset, Hamad Hassan, Fouad Dabboussi, Bruno Lina, Philippe Colson, Ana Tereza R. Vasconcelos, Monzer Hamze

Summary: This study provides a retrospective genomic surveillance of the SARS-CoV-2 pandemic in Lebanon, revealing the dominance of four different lineages and the role of Lebanon as a dispersal center for lineage B.1.398. The district of Tripoli in Lebanon was identified as a significant source of dispersal within the country. These findings highlight the potential role of developing countries in the emergence of new variants.

VIRUSES-BASEL (2022)

Article Multidisciplinary Sciences

Inference of differentially expressed genes using generalized linear mixed models in a pairwise fashion

Douglas Terra Machado, Otavio Jose Bernardes Brustolini, Yasmmin Cortes Martins, Marco Antonio Grivet Mattoso Maia, Ana Tereza Ribeiro de Vasconcelos

Summary: DEGRE is a user-friendly tool that considers fixed and random effects on individuals in the experimental design of RNA-Seq research to infer differentially expressed genes (DEGs). It preprocesses the data and applies generalized linear mixed models (GLMMs) to provide inference for DEGs. This tool efficiently removes genes that could impact the inference and offers improved assessment measures in cases with higher biological variability.
Article Biology

Multi-layered transcriptomic analysis reveals a pivotal role of FMR1 and other developmental genes in Alzheimer's disease-associated brain ceRNA network

Rafael Mina Piergiorge, Ronaldo da Silva Francisco Junior, Ana Tereza Ribeiro de Vasconcelos, Cintia Barros Santos-Reboucas

Summary: This study identified a complex ceRNA network involved in late-onset AD and highlighted the Fragile X Messenger Ribonucleoprotein 1 (FMR1) as a driver gene in this network. The findings enhance our understanding of ceRNA regulatory pathways in AD and provide potential targets for early biomarkers and therapeutic interventions.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Biochemistry & Molecular Biology

Differential Type-I Interferon Response in Buffy Coat Transcriptome of Individuals Infected with SARS-CoV-2 Gamma and Delta Variants

Guilherme C. da Fonseca, Liliane T. F. Cavalcante, Otavio J. Brustolini, Paula M. Luz, Debora C. Pires, Emilia M. Jalil, Eduardo M. Peixoto, Beatriz Grinsztejn, Valdilea G. Veloso, Sandro Nazer, Carlos A. M. Costa, Daniel A. M. Villela, Guilherme T. Goedert, Cleber V. B. D. Santos, Nadia C. P. Rodrigues, Fernando do Couto Motta, Marilda Mendonca Siqueira, Lara E. Coelho, Claudio J. Struchiner, Ana Tereza R. Vasconcelos

Summary: This article investigates the impact of the Gamma and Delta variants of SARS-CoV-2 on the host immune system, finding that Delta can more effectively activate the IFN-I response, while Gamma evades the host immune system by inhibiting the interferon response pathway.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2023)

Letter Virology

Genomic Surveillance Tracks the First Community Outbreak of the SARS-CoV-2 Delta (B.1.617.2) Variant in Brazil

Alessandra P. Lamarca, Luiz G. P. de Almeida, Ronaldo da Silva Francisco, Liliane Cavalcante, Douglas Terra Machado, Otavio Brustolini, Alexandra L. Gerber, Ana Paula de C. Guimaraes, Cintia Policarpo, Gleidson da Silva de Oliveira, Lidia Theodoro Boullosa, Isabelle Vasconcellos de Souza, Erika Martins de Carvalho, Mario Sergio Ribeiro, Silvia Carvalho, Flavio Dias da Silva, Marcio Henrique de Oliveira Garcia, Leandro Magalhaes de Souza, Cristiane Gomes Da Silva, Caio Luiz Pereira Ribeiro, Andrea Cony Cavalcanti, Claudia Maria Braga de Mello, Amilcar Tanuri, Ana Tereza R. Vasconcelosa

JOURNAL OF VIROLOGY (2022)

暂无数据