4.7 Article

ProSAR: A New Methodology for Combinatorial Library Design

Ask authors/readers for more resources

A method is introduced for performing reagent selection for chemical library design based on topological (2D) pharmacophore fingerprints. Optimal reagent selection is achieved by optimizing the Shannon entropy of the 2D pharmacophore distribution for the reagent set. The method, termed ProSAR, is therefore expected to enumerate compounds that could serve as a good starting point for deriving a structure activity relationship (SAR) in combinatorial library design. This methodology is exemplified by library design examples where the active compounds were already known. The results show that most of the pharmacophores on the substituents for the active compounds arc covered by the designed library. This strategy is further expanded to include product property profiles for aqueous solubility, hERG risk assessment, etc. in the optimization process so that the reagent pharmacophore diversity and the product property profile are optimized simultaneously via a genetic algorithm. This strategy is applied to a two-dimensional library design example and compared with libraries designed by a diversity based strategy which minimizes the average ensemble Tanimoto similarity. Our results show that by using the PSAR methodology, libraries can be designed with simultaneously good pharmacophore coverage and product property profile.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Chemistry, Medicinal

LibINVENT: Reaction-based Generative Scaffold Decoration for in Silico Library Design

Vendy Fialkova, Jiaxi Zhao, Kostas Papadopoulos, Ola Engkvist, Esben Jannik Bjerrum, Thierry Kogej, Atanas Patronov

Summary: The researchers have proposed a novel drug design tool called LibINVENT, capable of rapidly designing chemical libraries with shared cores and maximizing desirable properties. In addition, users can list specific chemical reactions for library creation, making this tool flexible for lead optimization.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Chemistry, Medicinal

Exploring Graph Traversal Algorithms in Graph-Based Molecular Generation

Rocio Mercado, Esben J. Bjerrum, Ola Engkvist

Summary: The impact of different graph traversal algorithms on molecular graph generation was explored in this study. Using a breadth-first traversal led to better coverage of training data features compared to a depth-first traversal, but overtraining can make the results with either graph traversal algorithm identical.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Infectious Diseases

Screening the pandemic response box identified benzimidazole carbamates, Olorofim and ravuconazole as promising drug candidates for the treatment of eumycetoma

Wilson Lim, Bertrand Nyuykonge, Kimberly Eadie, Mickey Konings, Juli Smeets, Ahmed Fahal, Alexandro Bonifaz, Matthew Todd, Benjamin Perry, Kirandeep Samby, Jeremy Burrows, Annelies Verbon, Wendy van de Sande

Summary: Eumycetoma is a chronic subcutaneous neglected tropical disease caused by various fungal agents. Current antifungal drugs have low cure rates, and there is a need for novel treatments. In this study, four compounds (fenbendazole, carbendazim, tafenoquine, and MMV1578570) were found to inhibit the growth of five species of fungi that cause eumycetoma. These compounds showed promising results in in vivo experiments as well.

PLOS NEGLECTED TROPICAL DISEASES (2022)

Editorial Material Infectious Diseases

Volunteer infection studies accelerate the clinical development of novel drugs against malaria

Claudia Daubenberger, Jeremy N. Burrows

LANCET INFECTIOUS DISEASES (2022)

Article Virology

The Antifungal Itraconazole Is a Potent Inhibitor of Chikungunya Virus Replication

Lucca R. Policastro, Isabela Dolci, Andre S. Godoy, Jose V. J. Silva Junior, Uriel E. A. Ruiz, Igor A. Santos, Ana C. G. Jardim, Kirandeep Samby, Jeremy N. Burrows, Timothy N. C. Wells, Laura H. V. G. Gil, Glaucius Oliva, Rafaela S. Fernandes

Summary: Chikungunya virus is the causative agent of chikungunya fever, a specific drug against this virus is needed for treatment.

VIRUSES-BASEL (2022)

Article Chemistry, Medicinal

De Novo Drug Design Using Reinforcement Learning with Graph- Based Deep Generative Models

Sara Romeo Atance, Juan Viguera Diez, Ola Engkvist, Simon Olsson, Rocio Mercado

Summary: This research proposes a new reinforcement learning scheme to fine-tune graph-based deep generative models for de novo molecular design. The proposed approach can successfully guide a pretrained generative model to generate molecules with specific properties, even if these molecules are not present in the training set.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2022)

Article Biology

DrugnomeAI is an ensemble machine-learning framework for predicting druggability of candidate drug targets

Arwa Raies, Ewa Tulodziecka, James Stainer, Lawrence Middleton, Ryan S. Dhindsa, Pamela Hill, Ola Engkvist, Andrew R. Harper, Slave Petrovski, Dimitrios Vitsios

Summary: In this study, a stochastic semi-supervised ML framework called DrugnomeAI was developed to estimate the druggability likelihood for every protein-coding gene in the human exome. The tool generates exome-wide predictions based on known drug targets and provides specialized models stratified by disease type or drug therapeutic modality. The results show enrichment of genes previously selected for clinical development programs, as well as genome-wide significance in phenome-wide association studies.

COMMUNICATIONS BIOLOGY (2022)

Article Chemistry, Medicinal

AiZynthTrain: Robust, Reproducible, and Extensible Pipelines for Training Synthesis Prediction Models

Samuel Genheden, Per-Ola Norrby, Ola Engkvist

Summary: We present AiZynthTrain Python package that trains synthesis models in a robust, reproducible, and extensible way. It includes two pipelines for creating one-step retrosynthesis and RingBreaker models, which can be easily integrated into retrosynthesis software. By training on publicly available reaction data from USPTO, these are the first reproducible end-to-end retrosynthesis models. The pipeline demonstrates improved RingBreaker performance and robustness when trained on a more diverse proprietary dataset. This framework is expected to be expanded with other synthesis models in the future.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Chemistry, Medicinal

Blinded Predictions and Post Hoc Analysis of the Second Solubility Challenge Data: Exploring Training Data and Feature Set Selection for Machine and Deep Learning Models

Jonathan G. M. Conn, James W. Carter, Justin J. A. Conn, Vigneshwari Subramanian, Andrew Baxter, Ola Engkvist, Antonio Llinas, Ekaterina L. Ratkova, Stephen D. Pickett, James L. McDonagh, David S. Palmer

Summary: Accurate prediction of solubility is highly desirable in the field of chemical sciences. In 2019, the American Chemical Society organized a Solubility Challenge to evaluate the state of the art in this area. This article describes the development of two models submitted to the challenge, based on computationally inexpensive molecular descriptors and traditional machine learning algorithms. The performance of these models is compared to more advanced algorithms and larger training data sets, revealing potential areas for improvement in solubility prediction models.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Chemistry, Physical

Understanding Intracellular Biology to Improve mRNA Delivery by Lipid Nanoparticles

Morag Rose Hunter, Lili Cui, Benjamin Thomas Porebski, Sara Pereira, Silvia Sonzini, Uchechukwu Odunze, Preeti Iyer, Ola Engkvist, Rebecca Louise Lloyd, Samantha Peel, Alan Sabirsh, Douglas Ross-Thriepland, Arwyn Tomos Jones, Arpan Shailesh Desai

Summary: This study utilized siRNA and small molecule profiling approaches along with advanced imaging and machine learning techniques to explore the mechanism of lipid nanoparticle delivery of mRNA. By analyzing data-rich phenotypic fingerprints extracted from images, the researchers identified fluid-phase endocytosis as a productive cellular entry route for enhanced delivery. The re-engineered nanoparticle targeting macropinocytosis significantly improved mRNA delivery in vitro and in vivo.

SMALL METHODS (2023)

Correction Biology

DrugnomeAI is an ensemble machine-learning framework for predicting druggability of candidate drug targets (vol 5, 1291, 2022)

Arwa Raies, Ewa Tulodziecka, James Stainer, Lawrence Middleton, Ryan S. S. Dhindsa, Pamela Hill, Ola Engkvist, Andrew R. R. Harper, Slave Petrovski, Dimitrios Vitsios

COMMUNICATIONS BIOLOGY (2023)

Article Chemistry, Multidisciplinary

De novo generated combinatorial library design

Simon Viet Johansson, Morteza Haghir Chehreghani, Ola Engkvist, Alexander Schliep

Summary: Artificial intelligence (AI) offers new approaches to design compounds in drug discovery, such as suggesting new molecular structures or optimizing existing leads. However, the lack of high-quality data sets limits the effectiveness of AI methods. This study proposes a framework for designing combinatorial libraries using a molecular generative model and optimization algorithms. Simulation experiments show that near-optimal library designs can be achieved even without synthesizing additional building blocks.

DIGITAL DISCOVERY (2023)

Article Chemistry, Multidisciplinary

Link-INVENT: generative linker design with reinforcement learning

Jeff Guo, Franziska Knuth, Christian Margreitter, Jon Paul Janet, Kostas Papadopoulos, Ola Engkvist, Atanas Patronov

Summary: This work presents Link-INVENT as an extension to the existing de novo molecular design platform REINVENT. Illustrative examples demonstrate its applications in fragment linking, scaffold hopping, and PROTAC design case studies. Link-INVENT, with the help of reinforcement learning, allows the agent to generate favorable linkers that connect molecular subunits satisfying diverse objectives, making it practical for real-world drug discovery projects.

DIGITAL DISCOVERY (2023)

Correction Computer Science, Artificial Intelligence

Improving de novo molecular design with curriculum learning (vol 4, pg 555, 2022)

Jeff Guo, Vendy Fialkova, Juan Diego Arango, Christian Margreitter, Jon Paul Janet, Kostas Papadopoulos, Ola Engkvist, Atanas Patronov

NATURE MACHINE INTELLIGENCE (2022)

Article Computer Science, Artificial Intelligence

Improving de novo molecular design with curriculum learning

Jeff Guo, Vendy Fialkova, Juan Diego Arango, Christian Margreitter, Jon Paul Janet, Kostas Papadopoulos, Ola Engkvist, Atanas Patronov

Summary: Guo and colleagues introduce curriculum learning extension to REINVENT, a de novo molecular design framework, which improves training efficiency by providing increasingly difficult problems over epochs.

NATURE MACHINE INTELLIGENCE (2022)

No Data Available