4.7 Article

Interoperable and scalable data analysis with microservices: applications in metabolomics

期刊

BIOINFORMATICS
卷 35, 期 19, 页码 3752-3760

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bioinformatics/btz160

关键词

-

资金

  1. European Commission [654241]
  2. Swedish Research Council FORMAS
  3. Uppsala Berzelii Technology Centre for Neurodiagnostics
  4. Ake Wiberg Foundation
  5. Nordic eInfrastructure Collaboration (NeIC) via the Glenna2 project
  6. Nordic eInfrastructure Collaboration (NeIC) via the Tryggve2 project
  7. BBSRC [BB/L024055/1, BB/M027635/1, BB/L024101/1, BB/H024921/1, BB/J020265/1, BB/I000771/1, BB/I025840/1] Funding Source: UKRI

向作者/读者索取更多资源

Motivation: Developing a robust and performant data analysis workflow that integrates all necessary components whilst still being able to scale over multiple compute nodes is a challenging task. We introduce a generic method based on the microservice architecture, where software tools are encapsulated as Docker containers that can be connected into scientific workflows and executed using the Kubernetes container orchestrator. Results: We developed a Virtual Research Environment (VRE) which facilitates rapid integration of new tools and developing scalable and interoperable workflows for performing metabolomics data analysis. The environment can be launched on-demand on cloud resources and desktop computers. IT-expertise requirements on the user side are kept to a minimum, and workflows can be re-used effortlessly by any novice user. We validate our method in the field of metabolomics on two mass spectrometry, one nuclear magnetic resonance spectroscopy and one fluxomics study. We showed that the method scales dynamically with increasing availability of computational resources. We demonstrated that the method facilitates interoperability using integration of the major software suites resulting in a turn-key workflow encompassing all steps for mass-spectrometry-based metabolomics including preprocessing, statistics and identification. Microservices is a generic methodology that can serve any scientific discipline and opens up for new types of large-scale integrative science.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Medical Laboratory Technology

Accuracy of determination of free light chains (Kappa and Lambda) in plasma and serum by Swedish laboratories as monitored by external quality assessment

Niclas Rollborn, Jenny Jakobsson, Andrew Campbell, Gunnar Nordin, Mathias Karlsson, Anders Larsson, Kim Kultima

Summary: The study found significant deviations in FLC measurements among different methods, with lower deviation when only nephelometry was used. There were significant differences in coefficient of variation between the two main FLC assays, and generally good coefficient of determination between reagents and instrument platforms.

CLINICAL BIOCHEMISTRY (2023)

Article Chemistry, Analytical

Reconstruction of Glutathione Metabolism in the Neuronal Model of Rotenone-Induced Neurodegeneration Using Mass Isotopologue Analysis with Hydrophilic Interaction Liquid Chromatography-Zeno High-Resolution Multiple Reaction Monitoring

Thomas Hankemeier, Luojiao Huang, Nicolas Drouin, Jason Causon, Agnieszka Wegrzyn, Jose Castro-Perez, Ronan Fleming, Amy Harms

Summary: Accurate reconstruction of metabolic pathways is crucial for understanding metabolomics changes and biological processes in diseases. A tracer-based metabolomics strategy using stable isotope labeled precursors can trace pathways by measuring the transformation of metabolites. By quantifying labeled metabolite substructures, a new method achieves simultaneous isotopic labeling information at the intact metabolite and moiety level. This method was applied to trace the fate of labeled atoms in human-induced pluripotent stem cell-derived neurons, revealing the pathway reconstruction of de novo glutathione synthesis and its alteration under oxidative stress and neurodegeneration.

ANALYTICAL CHEMISTRY (2023)

Article Behavioral Sciences

Integrative Multi-omics Analysis of Childhood Aggressive Behavior

Fiona A. Hagenbeek, Jenny van Dongen, Rene Pool, Peter J. Roetman, Amy C. Harms, Jouke Jan Hottenga, Cornelis Kluft, Olivier F. Colins, Catharina E. M. van Beijsterveldt, Vassilios Fanos, Erik A. Ehli, Thomas Hankemeier, Robert R. J. M. Vermeiren, Meike Bartels, Sebastien Dejean, Dorret Boomsma

Summary: This study introduces and demonstrates the potential of an integrated multi-omics approach in investigating the biology of childhood aggressive behavior. By using single- and integrative multi-omics models, the researchers identified biomarkers for subclinical aggression and studied the connections among these biomarkers. The study found strong associations between DNA methylation, amino acids, and parental non-transmitted polygenic scores with traits like ADHD, Autism Spectrum Disorder, intelligence, smoking initiation, and self-reported health. Aggression-related omics traits were also linked to known and novel risk factors such as inflammation, carcinogens, and smoking.

BEHAVIOR GENETICS (2023)

Article Clinical Neurology

Antibody-Positive Autoimmune Encephalitis and Paraneoplastic Neurological Syndrome: Epidemiology and Outcome of Neuronal Antibody Testing in Sweden

Sonja Kosek, Barbro Persson, Rui Rodrigues, Clas Malmestrom, Anna Rostedt Punga, Joachim Burman

Summary: This study aimed to estimate the 5-year incidence rate of autoimmune encephalitis (AE) and paraneoplastic neurological syndrome (PNS) in Sweden. The results showed that the incidence rate of AE and PNS doubled from 2015 to 2019.

ACTA NEUROLOGICA SCANDINAVICA (2023)

Article Biochemistry & Molecular Biology

Open data and algorithms for open science in AI- driven molecular informatics

Henning Otto Brinkhaus, Kohulan Rajan, Jonas Schaub, Achim Zielesny, Christoph Steinbeck

Summary: Recent years have witnessed a significant growth in the development of deep learning and AI-based molecular informatics. Although there is increasing interest in applying deep learning to various aspects of molecular informatics, the lack of FAIR and open data poses a constraint on the application of AI in this field. However, with the rise of open science practices and initiatives supporting open data and software, researchers in molecular informatics are encouraged to embrace open science and contribute to open repositories. The combination of open-source deep learning frameworks, cloud computing platforms, and a culture promoting open science provides opportunities for the continued growth of AI-driven molecular informatics.

CURRENT OPINION IN STRUCTURAL BIOLOGY (2023)

Review Plant Sciences

Advances in single-cell metabolomics to unravel cellular heterogeneity in plant biology

Kanchana Pandian, Minami Matsui, Thomas Hankemeier, Ahmed Ali, Emiko Okubo-Kurihara

Summary: Single-cell metabolomics is a powerful tool for understanding cellular heterogeneity and unraveling the mechanisms of biological phenomena. It holds great promise in plant research, especially when cellular heterogeneity affects different biological processes. Metabolomics, as a detailed phenotypic analysis, is expected to provide answers to previously unaddressed questions, leading to improved crop production, better understanding of disease resistance, and other applications.

PLANT PHYSIOLOGY (2023)

Article Multidisciplinary Sciences

Anti-Mullerian hormone and pregnancy after autologous hematopoietic stem cell transplantation for multiple sclerosis

Lida Zafeiri, Torbjoern akerfeldt, Andreas Tolf, Kristina Carlson, Alkistis Skalkidou, Joachim Burman

Summary: This study investigated the relationship between AMH levels and age and reproductive potential in MS patients treated with AHSCT. The results showed that although AMH concentration significantly decreased after AHSCT, six patients successfully conceived despite low concentrations, suggesting that high-dose cyclophosphamide treatment may not negatively impact fertility.

PLOS ONE (2023)

Article Chemistry, Multidisciplinary

MAW: the reproducible Metabolome Annotation Workflow for untargeted tandem mass spectrometry

Mahnoor Zulfiqar, Luiz Gadelha, Christoph Steinbeck, Maria Sorokina, Kristian Peters

Summary: Mapping the chemical space of compounds to chemical structures remains a challenge in metabolomics. Many novel computational methods and tools have been developed to enable chemical structure annotation to known and unknown compounds. Here, we present an automated and reproducible Metabolome Annotation Workflow (MAW) for untargeted metabolomics data. MAW combines tandem mass spectrometry input data pre-processing, spectral and compound database matching, computational classification, and in silico annotation to facilitate and automate complex annotation.

JOURNAL OF CHEMINFORMATICS (2023)

Article Multidisciplinary Sciences

Disease phenotype prediction in multiple sclerosis

Stephanie Herman, Staffan Arvidsson McShane, Christina Zjukovskaja, Payam Emami Khoonsari, Anders Svenningsson, Joachim Burman, Ola Spjuth, Kim Kultima

Summary: Currently, there is a need for biomarkers to assist in early diagnosis of progressive multiple sclerosis (PMS). Research has shown that a selection of cerebrospinal fluid metabolites can differentiate between PMS and its preceding phenotype. By using predictive methods, highly confident predictions can be made for patients who will develop PMS within three years. In a clinical trial, the methodology was applied to PMS patients receiving intrathecal treatment, and it was found that 68% of the patients decreased their similarity to the PMS phenotype after one year of treatment.

ISCIENCE (2023)

Article Biology

Control analysis in the identification of key enzymes driving metabolic adaptations: Towards drug target discovery

Pedro de Atauri, Carles Foguet, Marta Cascante

Summary: Metabolic Control Analysis (MCA) has revealed that control of metabolic pathways is distributed among many enzymes and depends on kinetic determinants in addition to stoichiometric structure. By incorporating kinetic determinants and ruling out enzymes with low control coefficients, MCA can improve the prediction and identification of therapeutic targets in drug discovery.

BIOSYSTEMS (2023)

Article Biochemical Research Methods

Development of a targeted hydrophilic interaction liquid chromatography-tandem mass spectrometry based lipidomics platform applied to a coronavirus disease severity study

Zhengzheng Zhang, Madhulika Singh, Alida Kindt, Agnieszka B. Wegrzyn, Mackenzie J. Pearson, Ahmed Ali, Amy C. Harms, Paul Baker, Thomas Hankemeier

Summary: The importance of lipidomics research in understanding various diseases, including metabolism, cancer, and the recent COVID-19 pandemic, has led to the development of a targeted HILIC-MS/MS method that allows for comprehensive analysis of lipid species. This method overcomes the challenges posed by the diverse structures and properties of lipids in biological systems, and provides accurate quantitation of lipid concentrations at the fatty acyl chain level. The applicability of this method has been demonstrated through the discovery of differential lipid features related to COVID-19 severity, highlighting its potential for future investigations of the lipidome in different disease contexts.

JOURNAL OF CHROMATOGRAPHY A (2023)

Article Clinical Neurology

Trajectories of cognitive processing speed and physical disability over 11 years following initiation of a first multiple sclerosis disease-modulating therapy

Elisa Longinetti, Simon Englund, Joachim Burman, Katharina Fink, Anna Fogdell-Hahn, Martin Gunnarsson, Jan Hillert, Annette Magdalene Langer-Gould, Jan Lycke, Petra Nilsson, Jonatan Salzer, Anders Svenningsson, Johan Mellergard, Tomas Olsson, Fredrik Piehl, Thomas Frisell

Summary: This study analyzed a Swedish nationwide observational study on RRMS to identify trajectories of processing speed and physical disability after DMT start. The results showed that patients' processing speed remained stable over time, while those with moderate physical disability experienced deterioration in physical function. However, there was a strong association between processing speed and disability.

JOURNAL OF NEUROLOGY NEUROSURGERY AND PSYCHIATRY (2023)

Article Clinical Neurology

Antibodies from serum and CSF of multiple sclerosis patients bind to oligodendroglial and neuronal cell-lines

Faisal Hayat Nazir, Anna Wiberg, Malin Mueller, Sara Mangsbo, Joachim Burman

Summary: Multiple sclerosis is a complex and heterogeneous disease that often starts as a clinically isolated syndrome. Autoantibodies play an important role in its pathogenesis, but their target has been difficult to identify. Cell-based methods have been developed as an alternative strategy for detecting autoantibodies. This study explored differences in antibody binding to oligodendroglial and neuronal cell-lines in serum and CSF samples from multiple sclerosis patients and controls, and found that the binding of immunoglobulin G from CSF to the human oligodendroglioma cell-line was the best discriminator between patients and controls, with a high sensitivity and specificity. The cell-based ELISA showed a high degree of accuracy in discriminating between multiple sclerosis patients and controls, with the disease course being the major determinant for antibody binding.

BRAIN COMMUNICATIONS (2023)

Article Chemistry, Multidisciplinary

Twenty years of nmrshiftdb2: A case study of an open database for analytical chemistry

Stefan Kuhn, Heinz Kolshorn, Christoph Steinbeck, Nils Schloerer

Summary: NMRshiftDB, now renamed as nmrshiftdb2, is a long-running open-source and open-content database in the field of open data in chemistry. After 20 years, the success of the project is evaluated, and lessons learned are presented for similar projects.

MAGNETIC RESONANCE IN CHEMISTRY (2023)

暂无数据