4.7 Review

Bioinformatics Mining and Modeling Methods for the Identification of Disease Mechanisms in Neurodegenerative Disorders

期刊

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES
卷 16, 期 12, 页码 29179-29206

出版社

MDPI
DOI: 10.3390/ijms161226148

关键词

mechanism-identification; bioinformatics; genetics; graphical models; knowledge-based modeling; multiscale; neurodegeneration; data integration; disease models

资金

  1. EU/EFPIA Innovative Medicines Initiative Joint Undertaking AETIONOMY [115568]

向作者/读者索取更多资源

Since the decoding of the Human Genome, techniques from bioinformatics, statistics, and machine learning have been instrumental in uncovering patterns in increasing amounts and types of different data produced by technical profiling technologies applied to clinical samples, animal models, and cellular systems. Yet, progress on unravelling biological mechanisms, causally driving diseases, has been limited, in part due to the inherent complexity of biological systems. Whereas we have witnessed progress in the areas of cancer, cardiovascular and metabolic diseases, the area of neurodegenerative diseases has proved to be very challenging. This is in part because the aetiology of neurodegenerative diseases such as Alzheimer ' s disease or Parkinson ' s disease is unknown, rendering it very difficult to discern early causal events. Here we describe a panel of bioinformatics and modeling approaches that have recently been developed to identify candidate mechanisms of neurodegenerative diseases based on publicly available data and knowledge. We identify two complementary strategiesdata mining techniques using genetic data as a starting point to be further enriched using other data-types, or alternatively to encode prior knowledge about disease mechanisms in a model based framework supporting reasoning and enrichment analysis. Our review illustrates the challenges entailed in integrating heterogeneous, multiscale and multimodal information in the area of neurology in general and neurodegeneration in particular. We conclude, that progress would be accelerated by increasing efforts on performing systematic collection of multiple data-types over time from each individual suffering from neurodegenerative disease. The work presented here has been driven by project AETIONOMY; a project funded in the course of the Innovative Medicines Initiative (IMI); which is a public-private partnership of the European Federation of Pharmaceutical Industry Associations (EFPIA) and the European Commission (EC).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Mathematical & Computational Biology

Deep neural network prediction of genome-wide transcriptome signatures - beyond the Black-box

Rasmus Magnusson, Jesper N. Tegner, Mika Gustafsson

Summary: Prediction algorithms for protein or gene structures, including transcription factor binding, have been informative in understanding gene regulation. The study found that the expression of 1600 TFs could explain over 95% of the variation in 25,000 genes, with a smaller core set of around 125 TFs explaining close to 80% of the variance. Reducing the number of TFs below 500 led to a rapid decline in prediction performance. By evaluating the prediction model using transcriptional data from 22 human diseases, the study demonstrated the ability of TFs to predict dysregulation of target genes, with key causative TFs identified for subsequent validation using disease-associated genetic variants.

NPJ SYSTEMS BIOLOGY AND APPLICATIONS (2022)

Review Biochemical Research Methods

On the influence of several factors on pathway enrichment analysis

Sarah Mubeen, Alpha Tom Kodamullil, Martin Hofmann-Apitius, Daniel Domingo-Fernandez

Summary: Pathway enrichment analysis is widely used for interpreting biomedical data, but its results are influenced by various factors. Benchmark studies have evaluated these factors and identified key influences on the analysis results.

BRIEFINGS IN BIOINFORMATICS (2022)

Editorial Material Cell Biology

Data-driven bioinformatics to disentangle cells within a tissue microenvironment

Jesper N. Tegner, David Gomez-Cabrero

Summary: Molecular profiling of clinical tissue samples is vital in precision medicine, but understanding the contribution of mixed cell types and detecting changes in cell populations due to infections or drugs is difficult. Recent advances in machine learning offer the promise of learning explanatory models directly from data.

TRENDS IN CELL BIOLOGY (2022)

Article Multidisciplinary Sciences

Deconvolution of the hematopoietic stem cell microenvironment reveals a high degree of specialization and conservation

Jin Ye, Isabel A. Calvo, Itziar Cenzano, Amaia Vilas, Xabier Martinez-de-Morentin, Miren Lasaga, Diego Alignani, Bruno Paiva, Ana C. Vinado, Patxi San Martin-Uriz, Juan P. Romero, Delia Quilez Agreda, Marta Minana Barrios, Ignacio Sancho-Gonzalez, Gabriele Todisco, Luca Malcovati, Nuria Planell, Borja Saez, Jesper N. Tegner, Felipe Prosper, David Gomez-Cabrero

Summary: This study developed a tailored bioinformatic pipeline to integrate public and proprietary single-cell RNA sequencing datasets, providing a comprehensive description of the regulatory microenvironment of HSCs in mice and suggesting conserved features in humans.

ISCIENCE (2022)

Article Biochemical Research Methods

Elucidating gene expression patterns across multiple biological contexts through a large-scale investigation of transcriptomic datasets

Rebeca Queiroz Figueiredo, Sara Diaz del Ser, Tamara Raschka, Martin Hofmann-Apitius, Alpha Tom Kodamullil, Sarah Mubeen, Daniel Domingo-Fernandez

Summary: Distinct gene expression patterns within various contexts were explored in this study to reveal specific functional roles and biological processes. By utilizing co-expression networks, unique and common patterns were identified, and pathway-level analysis further investigated the relationship between gene expression and specific contexts. The data and tools from this study are important for a deeper understanding of the relationship between gene expression and biological processes.

BMC BIOINFORMATICS (2022)

Article Biochemical Research Methods

PEMT: a patent enrichment tool for drug discovery

Yojana Gadiya, Andrea Zaliani, Philip Gribbon, Martin Hofmann-Apitius

Summary: The study develops a patent enrichment tool called PEMT, which extracts relevant patent information linked to chemical structures and gene names. It supports drug discovery and research by establishing a patent landscape around genes of interest. The tool is open-source and its source code and software package are available through the provided link.

BIOINFORMATICS (2023)

Article Multidisciplinary Sciences

scAEGAN: Unification of single-cell genomics data by adversarial learning of latent space correspondences

Sumeer Ahmad Khan, Robert Lehmann, Xabier Martinez-de-Morentin, Alberto Maillo, Vincenzo Lagani, Narsis A. Kiani, David Gomez-Cabrero, Jesper Tegner

Summary: Recent progress in Single-Cell Genomics has led to the development of different library protocols and techniques for molecular profiling. We have formulated a novel methodology called scAEGAN that integrates and predicts data from various libraries, samples, and data modalities. Through evaluations on simulated and real datasets, scAEGAN has demonstrated superior performance in library integration, data sparsity robustness, and paired data integration compared to Seurat3 and Seurat4. Furthermore, scAEGAN outperforms Babel in predictive analysis between different data modalities. Overall, scAEGAN surpasses current state-of-the-art methods and addresses the challenges of integration and prediction in single-cell genomics.

PLOS ONE (2023)

Article Biochemistry & Molecular Biology

Exploration and comparison of molecular mechanisms across diseases using MINERVA Net

Piotr Gawron, Ewa Smula, Reinhard Schneider, Marek Ostaszewski

Summary: MINERVA Net is an open-access repository of disease maps that allows users to publicly share minimal information about their maps. This article introduces the concept of MINERVA Net and demonstrates its use by comparing proteins and their interactions in three different disease maps.

PROTEIN SCIENCE (2023)

Article Mathematical & Computational Biology

Approximations of algorithmic and structural complexity validate cognitive-behavioral experimental results

Hector Zenil, James A. R. Marshall, Jesper Tegner

Summary: This study uses numerical approximation methods to objectively classify and evaluate animal behavior based on algorithmic and structural complexity. The research finds that animal behavior is influenced by environmental conditions and exhibits algorithmic biases in decision-making and reasoning processes. Additionally, experiments on human perception of randomness suggest the existence of algorithmic biases in human cognition.

FRONTIERS IN COMPUTATIONAL NEUROSCIENCE (2023)

Article Multidisciplinary Sciences

Forecasting individual progression trajectories in Alzheimer's disease

Etienne Maheux, Igor Koval, Juliette Ortholand, Colin Birkenbihl, Damiano Archetti, Vincent Bouteloup, Stephane Epelbaum, Carole Dufouil, Martin Hofmann-Apitius, Stanley Durrleman

Summary: This study developed a statistical model, AD Course Map, for predicting the progression of Alzheimer's disease (AD) based on current medical and radiological data. The model was tested on a large dataset of over 96,000 cases and showed high accuracy in predicting clinical endpoints. By enriching the population with predicted progressors, the required sample size for trials could be reduced by 38% to 50%.

NATURE COMMUNICATIONS (2023)

Article Biochemistry & Molecular Biology

DNA Methylation Signatures of Multiple Sclerosis Occur Independently of Known Genetic Risk and Are Primarily Attributed to B Cells and Monocytes

Alexandre Xavier, Vicki E. Maltby, Ewoud Ewing, Maria Pia Campagna, Sean M. Burnard, Jesper N. Tegner, Mark Slee, Helmut Butzkueven, Ingrid Kockum, Lara Kular, Vilija G. Jokubaitis, Trevor Kilpatrick, Lars Alfredsson, Maja Jagodic, Anne-Louise Ponsonby, Bruce V. Taylor, Rodney J. Scott, Rodney A. Lea, Jeannette Lechner-Scott

Summary: This study reveals that DNA methylation differences in multiple sclerosis (MS) occur independently of known genetic risk loci. It shows that these differences more effectively differentiate the disease compared to known genetic risk loci. The study also indicates that the methylation differences in MS predominantly occur in B cells and monocytes, involving cell-specific biological pathways.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2023)

Article Cell & Tissue Engineering

Generation of myoglobin (MB)-knockout human embryonic stem cell (hESC) line (KAIMRCe002-A-1S) using CRISPR/Cas9 technology

Maryam Alowaysi, Mohammad Al-Shehri, Moayad Baadhaim, Hajar AlZahrani, Doaa Aboalola, Mustafa Daghestani, Heba Hashem, Rawan Aljahdali, Rayan Salem, Adel Alharbi, Mohammed Muharraq, Khaled Alghamdi, Fawaz Alsobiy, Asima Zia, Robert Lehmann, Jesper Tegner, Khaled Alsayegh

Summary: Myoglobin (MB) is expressed in the heart and skeletal muscle, and it promotes oxygen diffusion for energy production. Recent research has shown that MB is also expressed in various malignant tumors and cancer cells. Further studies using gene disruption technology will enhance our understanding of MB's role in human biology and diseases.

STEM CELL RESEARCH (2023)

Article Multidisciplinary Sciences

Multi-ontology embeddings approach on human-aligned multi-ontologies representation for gene-disease associations prediction

Yihao Wang, Philipp Wegner, Daniel Domingo-Fernandez, Alpha Tom Kodamullil

Summary: This study investigates the effectiveness of infusing the embedding of two aligned ontologies as prior knowledge into NLP models, and demonstrates through experiments that the knowledge-infused model slightly outperforms the model without incorporating contextualized knowledge in gene-disease association prediction tasks. However, further research is needed to explore the generalizability of the model, and adding more bridges is expected to bring further improvement based on the observed trend.

HELIYON (2023)

Article Computer Science, Artificial Intelligence

Translating single-cell genomics into cell types

Jesper N. Tegner

Summary: Machine translation now enables automatic detection of different cell types from single-cell transcriptomic data, allowing for the potential to dissect complex clinical samples like heterogeneous tumors at a large scale.

NATURE MACHINE INTELLIGENCE (2023)

Article Computer Science, Artificial Intelligence

Reusability report: Learning the transcriptional grammar in single-cell RNA-sequencing data using transformers

Sumeer Ahmad Khan, Alberto Maillo, Vincenzo Lagani, Robert Lehmann, Narsis A. Kiani, David Gomez-Cabrero, Jesper Tegner

Summary: The rise of single-cell genomics has provided an opportunity for machine learning algorithms. The scBERT method, inspired by the success of BERT in natural language processing, is a data-driven tool for annotating cell types in single-cell genomics data. However, the imbalance in cell-type distribution significantly affects the performance of scBERT, and careful consideration of data distribution and the introduction of subsampling techniques are necessary.

NATURE MACHINE INTELLIGENCE (2023)

暂无数据