4.6 Review

Impact of Artificial Intelligence on Compound Discovery, Design, and Synthesis

Journal

ACS OMEGA
Volume 6, Issue 49, Pages 33293-33299

Publisher

AMER CHEMICAL SOC
DOI: 10.1021/acsomega.1c05512

Keywords

-

Ask authors/readers for more resources

Artificial intelligence is increasingly being investigated in the field of chemistry, particularly in medicinal chemistry. Traditional conservative attitudes in chemistry are slowly adapting to the use of AI technology to advance drug design and optimization.
As in other areas, artificial intelligence (AI) is heavily promoted in different scientific fields, including chemistry. Although chemistry traditionally tends to be a conservative field and slower than others to adapt new concepts, AI is increasingly being investigated across chemical disciplines. In medicinal chemistry, supported by computer-aided drug design and cheminformatics, computational methods have long been employed to aid in the search for and optimization of active compounds. We are currently witnessing a multitude of AI-related publications in the medicinal-chemistry-relevant literature and anticipate that the numbers will further increase. Often, advances through AI promoted in such reports are difficult to reconcile or remain questionable, which hampers the acceptance of computational work in interdisciplinary environments. Herein we attempt to highlight selected investigations in which AI has shown promise to impact medicinal chemistry in areas such as compound design and synthesis.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Editorial Material Chemistry, Medicinal

Chemical language models for applications in medicinal chemistry

Atsushi Yoshimori, Hengwei Chen, Juergen Bajorath

FUTURE MEDICINAL CHEMISTRY (2023)

Article Biochemistry & Molecular Biology

Explaining Accurate Predictions of Multitarget Compounds with Machine Learning Models Derived for Individual Targets

Alec Lamens, Jurgen Bajorath

Summary: Compounds with multiple targets (MT-CPDs) are important in drug discovery, and computational approaches have been used to design or identify them. Machine learning models have been derived to distinguish between MT-CPDs and compounds with single-target activity (ST-CPDs). Surprisingly, it was found that models derived for ST-CPDs can accurately predict MT-CPDs, revealing a relationship between the two types of compounds.

MOLECULES (2023)

Article Biochemistry & Molecular Biology

Isomeric Activity Cliffs-A Case Study for Fluorine Substitution of Aminergic G Protein-Coupled Receptor Ligands

Wojciech Pietrus, Rafal Kurczab, Dawid Warszycki, Andrzej J. J. Bojarski, Jurgen Bajorath

Summary: In this study, a total of 898 F-containing isomeric analog sets were identified and analyzed for structure-activity relationship (SAR) in the ChEMBL database. The results showed significant differences in affinity for some isomeric compounds against different aminergic GPCRs, and the change of fluorine position could lead to a significant change in potency. Additionally, a computational workflow was proposed to score the fluorine positions in the molecule.

MOLECULES (2023)

Article Pharmacology & Pharmacy

Evaluating the performance of machine-learning regression models for pharmacokinetic drug-drug interactions

Jaidip Gill, Marie Moullet, Anton Martinsson, Filip Miljkovic, Beth Williamson, Rosalinda H. Arends, Venkatesh Pilla Reddy

Summary: This study investigated the use of regression-based machine learning to predict changes in drug exposure caused by pharmacokinetic drug-drug interactions. The results show that changes in drug exposure can be predicted with reasonable accuracy using this method, which has potential applications in drug-drug interaction risk assessment.

CPT-PHARMACOMETRICS & SYSTEMS PHARMACOLOGY (2023)

Editorial Material Chemistry, Medicinal

Generative kinase inhibitor modeling viewed from a medicinal chemistry perspective

Juergen Bajorath

FUTURE MEDICINAL CHEMISTRY (2023)

Article Chemistry, Medicinal

Data-Driven Global Assessment of Protein Kinase Inhibitors with Emphasis on Covalent Compounds

Elena Xerxa, Filip Miljkovic, Juergen Bajorath

Summary: Large-scale analysis of public human and mouse protein kinase inhibitor (PKI) data revealed a significant number of human and mouse PKIs with reliable activity measurements. Human PKIs were found to exhibit coverage against a large number of kinases, with a substantial growth in the past years. Covalent PKIs containing acrylamide or heterocyclic urea warheads were particularly abundant, and they showed higher potency compared to structurally analogous PKIs. These findings have important implications for medicinal chemistry.

JOURNAL OF MEDICINAL CHEMISTRY (2023)

Article Biochemistry & Molecular Biology

Motif2Mol: Prediction of New Active Compounds Based on Sequence Motifs of Ligand Binding Sites in Proteins Using a Biochemical Language Model

Atsushi Yoshimori, Juergen Bajorath

Summary: In the field of drug design, very few studies have attempted the prediction of new active compounds from protein sequence data. This is mainly due to the challenging nature of this prediction task, as global protein sequence similarity has strong evolutionary and structural implications but is not directly related to ligand binding. However, the application of deep language models adapted from natural language processing provides new opportunities to attempt such predictions by linking amino acid sequences and chemical structures through textual molecular representations. In this study, a biochemical language model with a transformer architecture, named Motif2Mol, was introduced for the prediction of new active compounds based on sequence motifs of ligand binding sites. In a proof-of-concept application on inhibitors of more than 200 human kinases, Motif2Mol exhibited promising learning characteristics and an unprecedented ability to consistently reproduce known inhibitors of different kinases.

BIOMOLECULES (2023)

Article Biochemistry & Molecular Biology

Predicting Potent Compounds Using a Conditional Variational Autoencoder Based upon a New Structure-Potency Fingerprint

Tiago Janela, Kosuke Takeuchi, Juergen Bajorath

Summary: Prediction of potency of bioactive compounds is typically done using linear or nonlinear quantitative structure-activity relationship (QSAR) models. In this study, a novel approach called structure-potency fingerprint (SPFP) is introduced, which combines structural features and potency values into a single bit string representation. By using a conditional variational autoencoder (CVAE) with SPFPs, the potency module of test compounds can be accurately predicted using only their structure module. The SPFP-CVAE approach achieves comparable accuracy to support vector regression (SVR) and deep neural networks in predicting compounds' activity classes and potency values.

BIOMOLECULES (2023)

Article Computer Science, Artificial Intelligence

Interpretable bilinear attention network with domain adaptation improves drug-target prediction

Peizhen Bai, Filip Miljkovic, Bino John, Haiping Lu

Summary: DrugBAN is a deep bilinear attention network framework that explicitly learns the local interactions between drugs and targets, and adapts to out-of-distribution data. It achieves the best performance on three benchmark datasets compared to five state-of-the-art baseline models. The visualized bilinear attention map provides interpretable insights from prediction results.

NATURE MACHINE INTELLIGENCE (2023)

Article Chemistry, Medicinal

CIRCE: Web-Based Platform for the Prediction of Cannabinoid Receptor Ligands Using Explainable Machine Learning

Nicola Gambacorta, Fulvio Ciriaco, Nicola Amoroso, Cosimo Damiano Altomare, Ju''rgen Bajorath, Orazio Nicolotti

Summary: In this study, a CIRCE compound prediction platform was developed based on explainable machine learning to support the design of selective CB1R and CB2R ligands. The platform achieved about 80% accuracy in test calculations and provided rationalized explanations for the predictions. CIRCE is freely available as a web-based prediction platform.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Biochemistry & Molecular Biology

Explaining Multiclass Compound Activity Predictions Using Counterfactuals and Shapley Values

Alec Lamens, Juergen Bajorath

Summary: In pharmaceutical research, black box predictions from machine learning models hinder their application in guiding experimental work. This study presents a test system that combines two explainable artificial intelligence methods to better understand prediction outcomes and provide chemically intuitive explanations for model decisions.

MOLECULES (2023)

Article Biochemistry & Molecular Biology

Systematic Analysis of Covalent and Allosteric Protein Kinase Inhibitors

Elena Xerxa, Oliver Laufkoetter, Jurgen Bajorath

Summary: Protein kinase inhibitors (PKIs) are widely studied as drug candidates in drug discovery, with increasing interest in allosteric PKIs (APKIs) and covalent PKIs (CPKIs). The popularity of PKIs for therapeutic intervention has led to the accumulation of large volumes of activity data, enabling large-scale computational analysis. CPKIs with certain reactive groups showed significantly higher potency compared to non-covalent PKIs, but there was no general increase in promiscuity. Additionally, new APKIs have been identified, allowing for structure-based investigation of PK inhibition by covalent-allosteric mechanisms.

MOLECULES (2023)

Review Chemistry, Medicinal

Chemical language models for molecular design

Juergen Bajorath

Summary: This article discusses the opportunities and methods of applying chemical language models (CLMs) in drug discovery. CLMs can be developed using recurrent neural networks or transformer architectures, and attention mechanisms are used to improve predictive performance. CLMs can be used for constrained generative modeling and the prediction of chemical reactions or drug-target interactions. Since CLMs can learn mappings of different types of sequences and are applicable to any compound or target data presented in a sequential format and tokenized, they have a wide range of applications.

MOLECULAR INFORMATICS (2023)

Article Computer Science, Artificial Intelligence

Learning characteristics of graph neural networks predicting protein-ligand affinities

Andrea Mastropietro, Giuseppe Pasculli, Juergen Bajorath

Summary: In drug design, graph neural networks (GNNs) are used to predict compound potency by analyzing graph representations of protein-ligand interactions. The study reveals that GNNs are influenced by ligand memorization during learning and certain GNN architectures prioritize interaction information for predicting high affinities. While GNNs do not comprehensively account for protein-ligand interactions and physical reality, they provide a helpful balance between ligand memorization and learning of interaction patterns.

NATURE MACHINE INTELLIGENCE (2023)

No Data Available