4.7 Article

Deep-Learning Resources for Studying Glycan-Mediated Host-Microbe Interactions

Journal

CELL HOST & MICROBE
Volume 29, Issue 1, Pages 132-+

Publisher

CELL PRESS
DOI: 10.1016/j.chom.2020.10.004

Keywords

-

Funding

  1. Predictive BioAnalytics Initiative at the Wyss Institute for Biologically Inspired Engineering

Ask authors/readers for more resources

This study illustrates how machine learning and bioinformatics methods can be used to study the role of glycans in host-pathogen interactions, showing the ability to predict glycan immunogenicity, pathogenicity, and molecular mimicry using deep-learning models and glycan alignment methods. This expands our understanding of host-microbe interactions by identifying and studying the glycan motifs involved in immunogenicity, pathogenicity, molecular mimicry, and immune evasion.
Glycans, the most diverse biopolymer, are shaped by evolutionary pressures stemming from host-microbe interactions. Here, we present machine learning and bioinformatics methods to leverage the evolutionary information present in glycans to gain insights into how pathogens and commensals interact with hosts. By using techniques from natural language processing, we develop deep-learning models for glycans that are trained on a curated dataset of 19,299 unique glycans and can be used to study and predict glycan functions. We show that these models can be utilized to predict glycan immunogenicity and the pathogenicity of bacterial strains, as well as investigate glycan-mediated immune evasion via molecular mimicry. We also develop glycan-alignment methods and use these to analyze virulence-determining glycan motifs in the capsular polysaccharides of bacterial pathogens. These resources enable one to identify and study glycan motifs involved in immunogenicity, pathogenicity, molecular mimicry, and immune evasion, expanding our understanding of host-microbe interactions.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biotechnology & Applied Microbiology

Gene switch for l-glucose-induced biopharmaceutical production in mammalian cells

Tobias Strittmatter, Sabina Egli, Adrian Bertschi, Richard Plieninger, Daniel Bojar, Mingqi Xie, Martin Fussenegger

Summary: In this study, a gene switch using l-glucose and d-idonate to regulate gene expression was designed and successfully utilized to control the production of rituximab.

BIOTECHNOLOGY AND BIOENGINEERING (2021)

Article Biochemistry & Molecular Biology

The Role of Fucose-Containing Glycan Motifs Across Taxonomic Kingdoms

Luc Thomes, Daniel Bojar

Summary: This study examines the distribution and characteristics of fucose-containing glycan motifs across different taxa, revealing clear differences in fucose usage among various groups, even within the same domain, based on the physiology and habitat of organisms. The differences in fucose-containing motifs between vertebrates and invertebrates are highlighted, as well as the importance of fucose-containing motifs in molecular mimicry, as shown with pathogenic and non-pathogenic strains of Escherichia coli. This research sheds light on an important class of glycan motifs and provides new insights into the role of fucosylated glycans in symbiosis, pathogenicity, and immunity.

FRONTIERS IN MOLECULAR BIOSCIENCES (2021)

Article Biochemistry & Molecular Biology

A Useful Guide to Lectin Binding: Machine-Learning Directed Annotation of 57 Unique Lectin Specificities

Daniel Bojar, Lawrence Meche, Guanmin Meng, William Eng, David F. Smith, Richard D. Cummings, Lara K. Mahal

Summary: Glycans play critical roles in biology and medicine, but the specificity of lectins, which are key proteins in glycans binding, has not been well-defined. In this study, machine learning algorithms and expert annotation were used to define the lectin specificity for 57 unique lectins. This research provides important insights into the complex binding features of commercially available lectins.

ACS CHEMICAL BIOLOGY (2022)

Article Chemistry, Multidisciplinary

LectinOracle: A Generalizable Deep Learning Model for Lectin-Glycan Binding Prediction

Jon Lundstrom, Emma Korhonen, Frederique Lisacek, Daniel Bojar

Summary: LectinOracle model, combining transformer-based representations for proteins and graph convolutional neural networks for glycans, is able to predict protein-glycan interactions accurately and generalize well to new glycans and lectins. It has various applications in improving lectin classification, accelerating lectin directed evolution, predicting epidemiological outcomes, and analyzing host-microbe interactions.

ADVANCED SCIENCE (2022)

Article Biochemistry & Molecular Biology

Structural insights into host-microbe glycointeractions

Jon Lundstrom, Daniel Bojar

Summary: Despite their widespread presence in biological systems, glycans have historically received insufficient attention. Recent investigations have revealed the significance of glycans in regulating the human gut microbiota. This article provides a brief overview of current trends in computational and experimental research approaches, enhancing our understanding of host-microbe glycointeractions.

CURRENT OPINION IN STRUCTURAL BIOLOGY (2022)

Review Chemistry, Multidisciplinary

Glycoinformatics in the Artificial Intelligence Era

Daniel Bojar, Frederique Lisacek

Summary: The use of artificial intelligence methods in glycoinformatics is currently limited due to the unique challenges of glyco-data. However, with the accumulation of glycomics and glycan-binding data, as well as advancements in deep learning techniques, the future of glycoinformatics looks promising.

CHEMICAL REVIEWS (2022)

Article Multidisciplinary Sciences

Deep learning explains the biology of branched glycans from single-cell sequencing data

Rui Qin, Lara K. Mahal, Daniel Bojar

Summary: Glycosylation is a common occurrence in cells and is often dysregulated in diseases. Understanding the regulation and functional importance of different types of glycosylation at the cellular level is challenging experimentally. However, the use of multi-omics and single-cell measurements, such as SUGAR-seq, can help address this issue. In this study, a deep learning model was developed to predict glycan phenotypes of cells using transcriptomic data. The model interpretation process identified genes that are highly predictive and relevant to glycan biology. This work demonstrates the potential of interpretable deep learning models in uncovering novel functions and regulatory mechanisms of glycans from integrated transcriptomic and glycomic datasets.

ISCIENCE (2022)

Article Chemistry, Multidisciplinary

GlyLES: Grammar-based Parsing of Glycans from IUPAC-condensed to SMILES

Roman Joeres, Daniel Bojar, Olga V. Kalinina

Summary: Glycans are polysaccharides that play important roles in cellular processes. Their structures can be represented using IUPAC-condensed notation. However, there is a lack of an easy-to-use tool to convert IUPAC-condensed notation to SMILES for atomic-level representation.

JOURNAL OF CHEMINFORMATICS (2023)

Article Cell Biology

Mammalian milk glycomes: Connecting the dots between evolutionary conservation and biosynthetic pathways

Luc Thomes, Viktoria Karlsson, Jon Lundstrom, Daniel Bojar

Summary: Milk oligosaccharides (MOs) are important constituents of breast milk and their biosynthesis and evolutionary relationships have been studied using a comprehensive dataset of >100 mammalian species. The study identified systematic glycome biases, biosynthetic restrictions and conserved biosynthetic modules, advancing our understanding of glycan biosynthesis and the evolution of breast milk.

CELL REPORTS (2023)

Meeting Abstract Biochemistry & Molecular Biology

The Cup of Life is Not So Shallow: Milky Secrets

Daniel Bojar

GLYCOBIOLOGY (2022)

Meeting Abstract Biochemistry & Molecular Biology

Endless forms most beautiful - Merging machine learning and Glycobiology

Daniel Bojar

GLYCOBIOLOGY (2021)

Article Biochemistry & Molecular Biology

Glycowork: A Python package for glycan data science and machine learning

Luc Thomes, Rebekka Burkholz, Daniel Bojar

Summary: Glycowork is an open-source Python package designed for glycan-related data science and machine learning, providing functions such as automatic annotation of glycan motifs and analysis of their distributions. It also includes visualization methods, routines to interact with stored databases, trained machine learning models, and learned glycan representations. The tool aims to extract further insights from glycan datasets for various biological contexts.

GLYCOBIOLOGY (2021)

Article Social Sciences, Mathematical Methods

DeepConnection: classifying momentary relationship state from images of romantic couples

Maximiliane Uhlich, Daniel Bojar

Summary: By using deep learning methods, facial and bodily emotion expression, and other features, we can assess the momentary relationship state of romantic couples. Our new model, DeepConnection, achieved an average accuracy of nearly 97%, demonstrating its potential for informing couples, advancing relationship research, and assisting in couple therapy.

JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE (2021)

No Data Available