4.6 Article

Interoperable chemical structure search service

Journal

JOURNAL OF CHEMINFORMATICS
Volume 11, Issue -, Pages -

Publisher

BMC
DOI: 10.1186/s13321-019-0367-2

Keywords

Substructure search; Small molecule databases; Interoperability; Linked data

Funding

  1. ELIXIR CZ (MEYS) [LM2015047]
  2. Institute of Organic Chemistry and Biochemistry of the CAS [61388963]

Ask authors/readers for more resources

MotivationThe existing connections between large databases of chemicals, proteins, metabolites and assays offer valuable resources for research in fields ranging from drug design to metabolomics. Transparent search across multiple databases provides a way to efficiently utilize these resources. To simplify such searches, many databases have adopted semantic technologies that allow interoperable querying of the datasets using SPARQL query language. However, the interoperable interfaces of the chemical databases still lack the functionality of structure-driven chemical search, which is a fundamental method of data discovery in the chemical search space.ResultsWe present a SPARQL service that augments existing semantic services by making interoperable substructure and similarity searches in small-molecule databases possible. The service thus offers new possibilities for querying interoperable databases, and simplifies writing of heterogeneous queries that include chemical-structure search terms.AvailabilityThe service is freely available and accessible using a standard SPARQL endpoint interface. The service documentation and user-oriented demonstration interfaces that allow quick explorative querying of datasets are available at https://idsm.elixir-czech.cz.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemical Research Methods

ShinySOM: graphical SOM-based analysis of single-cell cytometry data

Miroslav Kratochvil, David Bednarek, Tomas Sieger, Karel Fiser, Jiri Vondrasek

BIOINFORMATICS (2020)

Correction Chemistry, Multidisciplinary

Interoperable chemical structure search service (vol 11, 45, 2019)

Miroslav Kratochvil, Jiri Vondrasek, Jakub Galgonek

JOURNAL OF CHEMINFORMATICS (2020)

Article Chemistry, Physical

Efficient Estimation of Absolute Binding Free Energy for a Homeodomain-DNA Complex from Nonequilibrium Pulling Simulations

David Jakubec, Jiri Vondrasek

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2020)

Article Biochemistry & Molecular Biology

The order of PDZ3 and TrpCage in fusion chimeras determines their properties-a biophysical characterization

Kristyna Bousova, Lucie Bednarova, Monika Zouharova, Veronika Vetyskova, Klara Postulkova, Katerina Hofbauerova, Olivia Petrvalska, Ondrej Vanek, Konstantinos Tripsianes, Jiri Vondrasek

Summary: Research indicates that domains extracted from proteins may retain their original structure and function in artificial fusion proteins, shedding light on communication rules between internal and external domains. Biophysical analysis of two fusion proteins reveals their distinct structural and denaturation properties.

PROTEIN SCIENCE (2021)

Article Chemistry, Multidisciplinary

IDSM ChemWebRDF: SPARQLing small-molecule datasets

Jakub Galgonek, Jiri Vondrasek

Summary: The integration of RDF and SPARQL enhances data interoperability and usability, with many biological and chemical databases presenting data in RDF and supporting SPARQL querying. Our project aims to build the Integrated Database of Small Molecules (IDSM) to integrate multiple datasets and improve mutual interoperability between them.

JOURNAL OF CHEMINFORMATICS (2021)

Article Biochemical Research Methods

COBREXA.jl: constraint-based reconstruction and exascale analysis

Miroslav Kratochvil, Laurent Heirendt, St Elmo Wilken, Taneli Pusa, Sylvain Arreckx, Alberto Noronha, Marvin van Aalst, Venkata P. Satagopam, Oliver Ebenhoh, Reinhard Schneider, Christophe Trefois, Wei Gu

Summary: COBREXA.jl is a Julia package that enables scalable and high-performance constraint-based reconstruction and analysis of very large-scale biological models. The package is designed to integrate modern high performance computing environments with the processing and analysis of complex metabolic models. The authors report on the architecture of the package and demonstrate its scalability in several use-cases involving multi-organism community models.

BIOINFORMATICS (2022)

Article Computer Science, Hardware & Architecture

On the User-Centric Comparative Remote Evaluation of Interactive Video Search Systems

Luca Rossetto, Ralph Gasser, Silvan Heller, Mahnaz Parian-Scherb, Loris Sauter, Florian Spiess, Heiko Schuldt, Ladislav Peska, Tomas Soucek, Miroslav Kratochvil, Frantisek Mejzlik, Patrik Vesely, Jakub Lokoc

Summary: In the research on video retrieval systems, dedicated retrieval competitions offer valuable insights into system performance, but are limited by set-up costs, logistics, and organization complexity. A proposed remote evaluation methodology provides statistically robust results and increases experiment replicability, making a significant contribution to open science in interactive retrieval benchmarks. Additionally, detailed evaluation reports provide new observations on previously inaccessible aspects of video retrieval.

IEEE MULTIMEDIA (2021)

Article Biochemistry & Molecular Biology

TRPM5 Channel Binds Calcium-Binding Proteins Calmodulin and S100A1

Kristyna Bousova, Monika Zouharova, Petr Herman, Jiri Vymetal, Veronika Vetyskova, Katerina Jiraskova, Jiri Vondrasek

Summary: In this study, we found that the N-terminus of TRPM5 interacts with calcium-binding proteins CaM and S100A1, and these binding regions possess unique basic residues.

BIOCHEMISTRY (2022)

Article Biology

The LOTUS initiative for open knowledge management in natural products research

Adriano Rutz, Maria Sorokina, Jakub Galgonek, Daniel Mietchen, Egon Willighagen, Arnaud Gaudry, James G. Graham, Ralf Stephan, Roderic Page, Jiri Vondrasek, Christoph Steinbeck, Guido F. Pauli, Jean-Luc Wolfender, Jonathan Bisson, Pierre-Marie Allard

Summary: Contemporary bioinformatic and chemoinformatic capabilities have the potential to reshape knowledge management, analysis, and interpretation in natural products research. However, the reliance on disparate and specialized databases poses challenges for data access and integration. The LOTUS initiative aims to address these challenges by consolidating and sharing referenced structure-organism pairs on an open platform, promoting collaboration and transformative research.

ELIFE (2022)

Article Biochemistry & Molecular Biology

Fusion of two unrelated protein domains in a chimera protein and its 3D prediction: Justification of the x-ray reference structures as a prediction benchmark

Jiri Vymetal, Katerina Mertova, Kristyna Bousova, Josef Sulc, Konstantinos Tripsianes, Jiri Vondrasek

Summary: Proteins can be naturally formed by domains with specific functional and structural properties. Artificial fusion multidomain proteins with unique combinations of functions can be constructed. The challenges in designing new proteins lie in structure/function prediction and its context dependency. Predicting the structure of fusion proteins is complex and nontrivial, even with the use of advanced computational methods.

PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS (2022)

Article Chemistry, Medicinal

Iterative Landmark-Based Umbrella Sampling (ILBUS) Protocol for Sampling of Conformational Space of Biomolecules

Jiri Vymetal, Jiri Vondrasek

Summary: This article proposes an efficient resampling protocol for computer simulations of biomolecules, which utilizes the ILBUS method to resample the conformational space using collected off-equilibrium trajectories, and reconstructs static equilibrium properties using the MBAR method. The ILBUS protocol does not require additional variables or dimension-reduction techniques, and only needs a set of reference conformations. Additionally, the ILBUS protocol can optimize the force constant used in the umbrella sampling simulation.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Biotechnology & Applied Microbiology

Interrogating the effect of enzyme kinetics on metabolism using differentiable constraint-based models

St Elmo Wilken, Mathieu Besancon, Miroslav Kratochvil, Chilperic Armel Foko Kuate, Christophe Trefois, Wei Gu, Oliver Ebenhoh

Summary: In this paper, the sensitivity analysis of constraint-based metabolic models is studied, and several applications of this technique are demonstrated. By calculating the sensitivity of reaction fluxes and enzyme concentrations to parameters, rate limiting enzymes can be identified and parameter estimation can be improved. Furthermore, this technique can be expanded to study complex models and validated against experimental data.

METABOLIC ENGINEERING (2022)

Proceedings Paper Computer Science, Interdisciplinary Applications

Translational Challenges of Biomedical Machine Learning Solutions in Clinical and Laboratory Settings

Carlos Vega, Miroslav Kratochvil, Venkata Satagopam, Reinhard Schneider

Summary: The increasing use of artificial intelligence in biomedical sciences requires closer interdisciplinary collaborations between life scientists and computer science researchers. The use of AI-based solutions in clinical and laboratory settings has proven problematic due to differences in data interpretation and mismatched result quality metrics. Overcoming these translational challenges can be guided by explicit interpretable inference reporting.

BIOINFORMATICS AND BIOMEDICAL ENGINEERING, PT II (2022)

Proceedings Paper Computer Science, Theory & Methods

GPU-Accelerated Mahalanobis-Average Hierarchical Clustering Analysis

Adam Smelko, Miroslav Kratochvil, Martin Krulis, Tomas Sieger

Summary: Hierarchical clustering is a widely used tool for simplification, exploration, and analysis of datasets in various research areas. A specific variant of this clustering method, Mahalanobis-average linkage, has been shown to outperform common linkages in flow cytometry data. The authors introduce an optimized, GPU-accelerated open-source implementation that significantly improves algorithm performance, making it suitable for large datasets.

EURO-PAR 2021: PARALLEL PROCESSING (2021)

Article Biology

GigaSOM.jl: High-performance clustering and visualization of huge cytometry datasets

Miroslav Kratochvil, Oliver Hunewald, Laurent Heirendt, Vasco Verissimo, Jiri Vondrasek, Venkata P. Satagopam, Reinhard Schneider, Christophe Trefois, Markus Ollert

GIGASCIENCE (2020)

No Data Available