☆ 4.8 Article

Improving the generalizability of protein-ligand binding predictions with AI-Bind

NATURE COMMUNICATIONS (2023)

Journal

NATURE COMMUNICATIONS

Volume 14, Issue 1, Pages -

Publisher

NATURE PORTFOLIO

DOI: 10.1038/s41467-023-37572-z

Keywords

Ask authors/readers for more resources

Protocol

Community support

Reagent

Community support

Automated Summary New
Abstract

Identifying novel drug-target interactions is a critical challenge in drug discovery, and current deep learning models struggle to generalize to new structures. This study introduces AI-Bind, a pipeline that combines network-based sampling strategies and unsupervised pre-training to improve binding predictions for new proteins and ligands. Validation through docking simulations and comparison with experimental evidence demonstrates the potential of AI-Bind in predicting protein-ligand binding and identifying active binding sites.

Identifying novel drug-target interactions is a critical and rate-limiting step in drug discovery. While deep learning models have been proposed to accelerate the identification process, here we show that state-of-the-art models fail to generalize to novel (i.e., never-before-seen) structures. We unveil the mechanisms responsible for this shortcoming, demonstrating how models rely on shortcuts that leverage the topology of the protein-ligand bipartite network, rather than learning the node features. Here we introduce AI-Bind, a pipeline that combines network-based sampling strategies with unsupervised pre-training to improve binding predictions for novel proteins and ligands. We validate AI-Bind predictions via docking simulations and comparison with recent experimental evidence, and step up the process of interpreting machine learning prediction of protein-ligand binding by identifying potential active binding sites on the amino acid sequence. AI-Bind is a high-throughput approach to identify drug-target combinations with the potential of becoming a powerful tool in drug discovery. State-of-the-art machine learning models in drug discovery fail to reliably predict the binding properties of poorly annotated proteins and small molecules. Here, the authors present AI-Bind, a machine learning pipeline to improve generalizability and interpretability of binding predictions.

Improving the generalizability of protein-ligand binding predictions with AI-Bind

Journal

NATURE COMMUNICATIONS

Publisher

NATURE PORTFOLIO

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Improving the generalizability of protein-ligand binding predictions with AI-Bind

Journal

NATURE COMMUNICATIONS

Publisher

NATURE PORTFOLIO

Keywords

Categories

Ask authors/readers for more resources

Protocol

Reagent

Authors

I am an author on this paper

Reviews

Primary Rating

Secondary Ratings

Novelty

Significance

Scientific rigor

Rate this paper

Recommended

Export Citation

Share Paper