4.3 Article

Consensus QSAR models estimating acute toxicity to aquatic organisms from different trophic levels: algae, Daphnia and fish

Journal

SAR AND QSAR IN ENVIRONMENTAL RESEARCH
Volume 31, Issue 9, Pages 655-675

Publisher

TAYLOR & FRANCIS LTD
DOI: 10.1080/1062936X.2020.1797872

Keywords

QSAR/QSPR; acute aquatic toxicity; generative topographic mapping (GTM); REACH

Ask authors/readers for more resources

We report new consensus models estimating acute toxicity for algae,Daphniaand fish endpoints. We assembled a large collection of 3680 public unique compounds annotated by, at least, one experimental value for the given endpoint. Support Vector Machine models were internally and externally validated following the OECD principles. Reasonable predictive performances were achieved (RMSEext = 0.56-0.78) which are in line with those of state-of-the-art models. The known structural alerts are compared with analysis of the atomic contributions to these models obtained using the ISIDA/ColorAtom utility. A benchmarking against existing tools has been carried out on a set of compounds considered more representative and relevant for the chemical space of the current chemical industry. Our model scored one of the best accuracy and data coverage. Nevertheless, industrial data performances were noticeably lower than those on public data, indicating that existing models fail to meet the industrial needs. Thus, final models were updated with the inclusion of new industrial compounds, extending the applicability domain and relevance for application in an industrial context. Generated models and collected public data are made freely available.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemical Research Methods

HIV-1 drug resistance profiling using amino acid sequence space cartography

Karina Pikalyova, Alexey Orlov, Arkadii Lin, Olga Tarasova, Marcou Gilles, Dragos Horvath, Vladimir Poroikov, Alexandre Varnek

Summary: A new methodology based on generative topographic mapping (GTM) was introduced for predicting the drug resistance of HIV strains. The approach combines high accuracy and interpretability, allowing for visualization and analysis of sequence space and treatment optimization. Several case studies demonstrate the practicality of this method.

BIOINFORMATICS (2022)

Article Chemistry, Medicinal

SynthI: A New Open-Source Tool for Synthon-Based Library Design

Yuliana Zabolotna, Dmitriy M. Volochnyuk, Sergey Ryabukhin, Kostiantyn Gavrylenko, Dragos Horvath, Olga Klimchuk, Oleksandr Oksiuta, Gilles Marcou, Alexandre Varnek

Summary: Most existing computational tools for de novo library design focus on generating, selecting, and combining structural motifs to form new library members. However, these approaches appear to be more theoretical and disconnected from reality due to the lack of a direct link between the chemical space of the retrosynthesized fragments and the pool of available reagents. This paper presents a new open-source toolkit called Synthons Interpreter (SynthI), which merges these two chemical spaces into a single synthons space.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Chemistry, Medicinal

A Close-up Look at the Chemical Space of Commercially Available Building Blocks for Medicinal Chemistry

Yuliana Zabolotna, Dmitriy M. Volochnyuk, Sergey V. Ryabukhin, Dragos Horvath, Konstantin S. Gavrilenko, Gilles Marcou, Yurii S. Moroz, Oleksandr Oksiuta, Alexandre Varnek

Summary: Efficient synthesis of desired compounds is crucial for chemical space exploration in drug discovery, which is influenced by both established synthetic protocols and the availability of corresponding building blocks (BBs). This study analyzes the chemical space of 400,000 purchasable BBs, examining their physicochemical properties and diversity to assess their coverage of medicinal chemistry needs. The analysis is based on a universal topographic map that visualizes libraries and their differences in coverage.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Chemistry, Medicinal

Exploration of the Chemical Space of DNA-encoded Libraries

Regina Pikalyova, Yuliana Zabolotna, Dmitriy M. Volochnyuk, Dragos Horvath, Gilles Marcou, Alexandre Varnek

Summary: DNA-Encoded Library (DEL) technology is a method for discovering bioactive molecules in medicinal chemistry. This project aimed to generate and analyze an ultra-large chemical space of DEL using commercially available building blocks. The study compared the DEL compounds to biologically relevant compounds from ChEMBL and identified optimal DELs covering the chemical space of ChEMBL. Different combinations of DELs were analyzed to achieve even higher coverage of ChEMBL than with a single DEL.

MOLECULAR INFORMATICS (2022)

Article Biochemistry & Molecular Biology

Molecular Similarity Perception Based on Machine-Learning Models

Enrico Gandini, Gilles Marcou, Fanny Bonachera, Alexandre Varnek, Stefano Pieraccini, Maurizio Sironi

Summary: Molecular similarity is a widely applicable and significant topic in chemistry. It has various applications in pharmaceutical research, such as studying structure-activity relationships, and in determining the status of orphan drugs. Models were built and evaluated using expert judgments to reduce human efforts. A dataset of new molecules and an online tool for molecular similarity estimation have been made freely available.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2022)

Article Biochemistry & Molecular Biology

Rapid Discrimination of Neuromyelitis Optica Spectrum Disorder and Multiple Sclerosis Using Machine Learning on Infrared Spectra of Sera

Youssef El Khoury, Marie Gebelin, Jerome de Seze, Christine Patte-Mensah, Gilles Marcou, Alexandre Varnek, Ayikoe-Guy Mensah-Nyagan, Petra Hellwig, Nicolas Collongues

Summary: Neuromyelitis optica spectrum disorder (NMOSD) and multiple sclerosis (MS) are autoimmune inflammatory and demyelinating diseases of the central nervous system. Differentiating between NMOSD and MS is crucial for the timely and appropriate treatment. In this study, infrared spectroscopy combined with machine learning successfully distinguished NMOSD and MS patients with 100% sensitivity and specificity. The method shows promise as a cost-effective and rapid diagnostic tool to help save valuable treatment time for patients.

INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (2022)

Article Chemistry, Multidisciplinary

Toward in Silico Modeling of Dynamic Combinatorial Libraries

Iuri Casciuc, Artem Osypenko, Bohdan Kozibroda, Dragos Horvath, Gilles Marcou, Fanny Bonachera, Alexandre Varnek, Jean-Marie Lehn

Summary: This study proposes a chemoinformatic model for assessing the composition of dynamic combinatorial libraries (DCLs) theoretically. The model predicts the constants for DCL constituents and provides a virtual illustration on how effector affinity regulates DCL members.

ACS CENTRAL SCIENCE (2022)

Article Chemistry, Multidisciplinary

Computational screening methodology identifies effective solvents for CO2 capture

Alexey A. Orlov, Alain Valtz, Christophe Coquelet, Xavier Rozanska, Erich Wimmer, Gilles Marcou, Dragos Horvath, Benedicte Poulain, Alexandre Varnek, Frederick de Meyer

Summary: Researchers have developed a computational approach that combines kinetic experiments, molecular simulations, and machine learning to identify a class of tertiary amines that absorb CO2 faster than a typical commercial solvent when mixed with piperazine.

COMMUNICATIONS CHEMISTRY (2022)

Article Chemistry, Medicinal

Chemspace Atlas: Multiscale Chemography of Ultralarge Libraries for Drug Discovery

Yuliana Zabolotna, Fanny Bonachera, Dragos Horvath, Arkadii Lin, Gilles Marcou, Olga Klimchuk, Alexandre Varnek

Summary: Nowadays, drug discovery requires understanding of chemotype composition and physicochemical properties of large compound collections. ChemSpace Atlas is a freely accessible tool that allows multifaceted analysis and visualization of chemical libraries, supporting activity profiling and analogue search. It will be expanded to include new chemical subspaces and functionalities.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Biochemistry & Molecular Biology

In Vitro Evaluation of In Silico Screening Approaches in Search for Selective ACE2 Binding Chemical Probes

Alexey Rayevsky, Andrii S. Poturai, Iryna O. Kravets, Alexander E. Pashenko, Tatiana A. Borisova, Ganna M. Tolstanova, Dmitriy M. Volochnyuk, Petro O. Borysko, Olga B. Vadzyuk, Diana O. Alieksieieva, Yuliana Zabolotna, Olga Klimchuk, Dragos Horvath, Gilles Marcou, Sergey Ryabukhin, Alexandre Varnek

Summary: New models for ACE2 receptor binding were developed using QSAR and docking algorithms, and the selectivity of ACE2-binding ligands towards NEP and ACE was evaluated. Virtual screening of the Enamine screening collection was conducted to find potential ACE2-chemical probes for studying SARS-CoV2-induced neurological disorders. An enzymology inhibition assay for ACE2 was optimized, and the predicted selective ACE2-binding molecules from QSAR modeling, docking, and ultrafast docking were screened in vitro, resulting in the identification of two novel chemotypes suitable for further optimization.

MOLECULES (2022)

Article Chemistry, Medicinal

Inverse QSAR: Reversing Descriptor-Driven Prediction Pipeline Using Attention-Based Conditional Variational Autoencoder

William Bort, Daniyar Mazitov, Dragos Horvath, Fanny Bonachera, Arkadii Lin, Gilles Marcou, Igor Baskin, Timur Madzhidov, Alexandre Varnek

Summary: This paper proposes a new attention-based conditional variational autoencoder neural network architecture to solve the inverse QSAR problem. Using this method, it is possible to generate novel compounds with desired activity, and these compounds exhibit acceptable druglikeness and synthetic accessibility. Pharmacophore and docking studies validate the accuracy of the activity predictions for some of these novel structures.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Chemistry, Medicinal

GENERA: A Combined Genetic/Deep-Learning Algorithm for Multiobjective Target-Oriented De Novo Design

Giuseppe Lamanna, Pietro Delre, Gilles Marcou, Michele Saviano, Alexandre Varnek, Dragos Horvath, Giuseppe Felice Mangiatordi

Summary: This study introduces a new de novo design algorithm called GENERA that combines the capabilities of a deep-learning algorithm for automated drug-like analogue design, called DeLA-Drug, with a genetic algorithm for generating molecules with desired target-oriented properties. GENERA was applied to the angiotensin-converting enzyme 2 (ACE2) target, and its ability to de novo design promising candidates was assessed using docking programs PLANTS and GLIDE. The study demonstrates that GENERA can effectively perform multiobjective optimization and generate focused libraries with better scores compared to a starting set of known ACE-2 binders.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Chemistry, Medicinal

Chemical Library Space: Definition and DNA-Encoded Library Comparison Study Case

Regina Pikalyova, Yuliana Zabolotna, Dragos Horvath, Gilles Marcou, Alexandre Varnek

Summary: The development of DNA-encoded library (DEL) technology has brought new challenges to the analysis of chemical libraries. This study introduces the concept of chemical library space (CLS) and compares four representations obtained using generative topographic mapping. These encodings allow for effective comparison of libraries and fine-tuning of matching criteria. The proposed CLS can be used for efficient analysis and selection of chemical libraries.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Chemistry, Medicinal

Meta-GTM: Visualization and Analysis of the Chemical Library Space

Regina Pikalyova, Yuliana Zabolotna, Dragos Horvath, Gilles Marcou, Alexandre Varnek

Summary: In chemical library analysis, it can be beneficial to describe libraries as individual items rather than collections of compounds. This is especially true for large non-selectable compound mixtures like DNA-encoded libraries (DELs). The chemical library space (CLS) is useful for managing a portfolio of libraries, similar to how chemical space (CS) helps manage portfolios of molecules. Mapping the CLS on meta-GTMs allows for analysis beyond pairwise library comparison, facilitating the selection of the most suitable libraries for specific projects.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Biochemistry & Molecular Biology

Comprehensive analysis of commercial fragment libraries

Julia Revillo Imbernon, Celien Jacquemard, Guillaume Bret, Gilles Marcou, Esther Kellenberger

Summary: The screening of fragment libraries is crucial in drug discovery, with the success depending on the quality and design of the library meeting specific research requirements. This study conducted an inventory of commercial fragment libraries and developed a methodology to classify any library based on its similarity, coverage, and structural features, leading to the creation of a model that considers fragment diversity and ease of interpretation.

RSC MEDICINAL CHEMISTRY (2022)

No Data Available