4.7 Article

DTI-CDF: a cascade deep forest model towards the prediction of drug-target interactions based on hybrid features

期刊

BRIEFINGS IN BIOINFORMATICS
卷 22, 期 1, 页码 451-462

出版社

OXFORD UNIV PRESS
DOI: 10.1093/bib/bbz152

关键词

drug-target interaction; machine learning; ensemble learning; cascade deep forest

资金

  1. National Key Research and Development Program of China [2016YFA0501703]
  2. National Natural Science Foundation of China [31601074, 61872094, 61832019]

向作者/读者索取更多资源

Drug-target interactions are crucial in drug discovery, but existing prediction methods suffer from low precision and high false-positive rates. The proposed DTI-CDF model significantly outperforms traditional methods and accurately predicts new DTIs.
Drug-target interactions (DTIs) play a crucial role in target-based drug discovery and development. Computational prediction of DTIs can effectively complement experimental wet-lab techniques for the identification of DTIs, which are typically time- and resource-consuming. However, the performances of the current DTI prediction approaches suffer from a problem of low precision and high false-positive rate. In this study, we aim to develop a novel DTI prediction method for improving the prediction performance based on a cascade deep forest (CDF) model, named DTI-CDF, with multiple similarity-based features between drugs and the similarity-based features between target proteins extracted from the heterogeneous graph, which contains known DTIs. In the experiments, we built five replicates of 10-fold cross-validation under three different experimental settings of data sets, namely, corresponding DTI values of certain drugs (SD), targets (ST), or drug-target pairs (SP) in the training sets are missed but existed in the test sets. The experimental results demonstrate that our proposed approach DTI-CDF achieves a significantly higher performance than that of the traditional ensemble learning-based methods such as random forest and XGBoost, deep neural network, and the state-of-the-art methods such as DDR. Furthermore, there are 1352 newly predicted DTIs which are proved to be correct by KEGG and DrugBank databases.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Biochemical Research Methods

Irinotecan and vandetanib create synergies for treatment of pancreatic cancer patients with concomitant TP53 and KRAS mutations

Aman Chandra Kaushik, Yan-Jing Wang, Xiangeng Wang, Dong-Qing Wei

Summary: The study revealed significant differences in the genomics, transcriptomics, methylomics, and molecular dynamics between KRAS and TP53 mutations from the wild type in PAAD, with the prognosis of pancreatic cancer directly linked to specific KRAS mutations and protein stability. Screened drugs show potential effectiveness for PAAD patients.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

scHiCStackL: a stacking ensemble learning-based method for single-cell Hi-C classification using cell embedding

Hao Wu, Yingfu Wu, Yuhong Jiang, Bing Zhou, Haoru Zhou, Zhongli Chen, Yi Xiong, Quanzhong Liu, Hongming Zhang

Summary: This study proposes a high accuracy cell classification algorithm, scHiCStackL, based on single-cell Hi-C data. The algorithm improves the data preprocessing method and constructs a two-layer stacking ensemble model for classifying cells. Experimental results show that scHiCStackL achieves superior performance in predicting cell types using single-cell Hi-C data.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

MDF-SA-DDI: predicting drug-drug interaction events based on multi-source drug fusion, multi-source feature fusion and transformer self-attention mechanism

Shenggeng Lin, Yanjing Wang, Lingfeng Zhang, Yanyi Chu, Yatong Liu, Yitian Fang, Mingming Jiang, Qiankun Wang, Bowen Zhao, Yi Xiong, Dong-Qing Wei

Summary: One of the main problems with the joint use of multiple drugs is the potential for adverse drug interactions and side effects. This study proposes a novel method, MDF-SA-DDI, which predicts drug-drug interaction events using multi-source drug fusion, multi-source feature fusion, and transformer self-attention mechanism.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

T4SEfinder: a bioinformatics tool for genome-scale prediction of bacterial type IV secreted effectors using pre-trained protein language model

Yumeng Zhang, Yangming Zhang, Yi Xiong, Hui Wang, Zixin Deng, Jiangning Song, Hong-Yu Ou

Summary: Bacterial type IV secretion systems (T4SSs) play crucial roles in bacterial pathogenesis, with versatile functions. Using a sequence embedding strategy from a pre-trained language model can enhance the accuracy and speed of T4SE prediction tools.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

De novo molecular design with deep molecular generative models for PPI inhibitors

Jianmin Wang, Yanyi Chu, Jiashun Mao, Hyeon-Nae Jeon, Haiyan Jin, Amir Zeb, Yuil Jang, Kwang-Hwi Cho, Tao Song, Kyoung Tai No

Summary: This study constructs a dataset for protein-protein interaction (PPI) targeted drug-likeness and proposes a deep molecular generative framework to generate novel drug-like molecules based on the features of seed compounds. The results show that the generated molecules have better PPI-targeted drug-likeness and drug-likeness, and the model performs comparably to other state-of-the-art molecule generation models.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biology

DeepPSE: Prediction of polypharmacy side effects by fusing deep representation of drug pairs and attention mechanism

Shenggeng Lin, Guangwei Zhang, Dong-Qing Wei, Yi Xiong

Summary: Polypharmacy is an effective strategy for treating complex or co-existing diseases, but it can lead to higher risk of adverse side effects due to drug interactions. This study proposes a deep learning-based method called DeepPSE to predict polypharmacy side effects. Experimental results demonstrate that DeepPSE outperforms five baseline or state-of-the-art methods in predicting drug pair side effects.

COMPUTERS IN BIOLOGY AND MEDICINE (2022)

Article Biochemical Research Methods

Subtype-DCC: decoupled contrastive clustering method for cancer subtype identification based on multi-omics data

Jing Zhao, Bowen Zhao, Xiaotong Song, Chujun Lyu, Weizhi Chen, Yi Xiong, Dong-Qing Wei

Summary: The Subtype-DCC method, which integrates multi-omics data, is proposed for cancer subtyping and demonstrates superior performance compared to existing clustering methods. It has potential applications in cancer diagnosis, prognosis, and treatment.

BRIEFINGS IN BIOINFORMATICS (2023)

Article Biochemical Research Methods

SAM-DTA: a sequence -agnostic model for drug-target binding affinity prediction

Zhiqiang Hu, Wenfeng Liu, Chenbin Zhang, Jiawen Huang, Shaoting Zhang, Huiqun Yu, Yi Xiong, Hao Liu, Song Ke, Liang Hong

Summary: Drug-target binding affinity prediction is a crucial task in drug discovery. Surprisingly, this study shows that a model can achieve superior performance without any protein sequence information. Instead, the model characterizes proteins based on the ligands they interact with. The proposed paradigm outperforms sequence-based methods in terms of Mean Squared Error (MSE) and R-Square.

BRIEFINGS IN BIOINFORMATICS (2023)

Article Chemistry, Medicinal

LS-MolGen: Ligand-and-Structure Dual-Driven Deep Reinforcement Learning for Target-Specific Molecular Generation Improves Binding Affinity and Novelty

Song Li, Chao Hu, Song Ke, Chenxing Yang, Jun Chen, Yi Xiong, Hao Liu, Liang Hong

Summary: This article presents a novel ligand and structure integrated molecular generative model called LS-MolGen, which combines representation learning, transfer learning, and reinforcement learning. The model demonstrates superior performance in generating promising compounds with novel scaffolds and high binding affinity for drug design.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Genetics & Heredity

NetGO 3.0: Protein Language Model Improves Large-scale Functional Annotations

Shaojun Wang, Ronghui You, Yunjia Liu, Yi Xiong, Shanfeng Zhu

Summary: NetGO 2.0 is an automated function prediction method that integrates multiple sources of information to improve performance. However, it does not leverage valuable information from a large number of unannotated proteins. Recently, protein language models have been proposed to learn informative representations from protein sequences. By incorporating this model into NetGO 2.0, researchers have developed NetGO 3.0, which significantly improves the performance of automated function prediction.

GENOMICS PROTEOMICS & BIOINFORMATICS (2023)

Article Chemistry, Medicinal

A Multimodal Deep Learning Framework for Predicting PPI-Modulator Interactions

Heqi Sun, Jianmin Wang, Hongyan Wu, Shenggeng Lin, Junwei Chen, Jinghua Wei, Shuai Lv, Yi Xiong, Dong-Qing Wei

Summary: This study proposes a deep learning framework called MultiPPIMI for predicting the interaction between PPI targets and modulators. Experimental results show that MultiPPIMI performs well in predicting both cold-start scenarios and random-split scenarios, and can assist in screening inhibitors for Keap1/Nrf2 PPI interactions.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Biochemical Research Methods

MATT-DDI: Predicting multi-type drug-drug interactions via heterogeneous attention mechanisms

Shenggeng Lin, Xueying Mao, Liang Hong, Shuangjun Lin, Dong-Qing Wei, Yi Xiong

Summary: This article introduces a novel method, MATT-DDI, for predicting multi-type drug-drug interactions (DDIs) by accurately considering the original feature vectors of drugs and multiple attention mechanisms. Experimental results show that MATT-DDI has good performance and no information leakage.

METHODS (2023)

Article Biochemical Research Methods

SAM-DTA: a sequence-agnostic model for drug-target binding affinity prediction

Zhiqiang Hu, Wenfeng Liu, Chenbin Zhang, Jiawen Huang, Shaoting Zhang, Huiqun Yu, Yi Xiong, Hao Liu, Song Ke, Liang Hong

Summary: This study demonstrates that drug-target binding affinity prediction can achieve superior performance without accessing protein-sequence-related information. Instead, the study characterizes a protein solely based on the ligands it interacts with. By separately treating different proteins and training them jointly, a robust and universal representation of ligands is learned, which outperforms sequence-based methods. The results highlight the model's robustness in generalizing to unseen proteins and predicting future data.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Computer Science, Artificial Intelligence

A transformer-based model to predict peptide-HLA class I binding and optimize mutated peptides for vaccine design

Yanyi Chu, Yan Zhang, Qiankun Wang, Lingfeng Zhang, Xuhong Wang, Yanjing Wang, Dennis Russell Salahub, Qin Xu, Jianmin Wang, Xue Jiang, Yi Xiong, Dong-Qing Wei

Summary: The study presents the TransMut framework, which automatically optimizes mutated peptides with higher affinity to the target HLA allele. This framework provides an automated approach for screening immunogenic peptides and vaccine design.

NATURE MACHINE INTELLIGENCE (2022)

暂无数据