4.6 Article

Ensemble of decision tree reveals potential miRNA-disease associations

期刊

PLOS COMPUTATIONAL BIOLOGY
卷 15, 期 7, 页码 -

出版社

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1007209

关键词

-

资金

  1. National Natural Science Foundation of China [61772531]

向作者/读者索取更多资源

In recent years, increasing associations between microRNAs (miRNAs) and human diseases have been identified. Based on accumulating biological data, many computational models for potential miRNA-disease associations inference have been developed, which saves time and expenditure on experimental studies, making great contributions to researching molecular mechanism of human diseases and developing new drugs for disease treatment. In this paper, we proposed a novel computational method named Ensemble of Decision Tree based MiRNA-Disease Association prediction (EDTMDA), which innovatively built a computational framework integrating ensemble learning and dimensionality reduction. For each miRNA-disease pair, the feature vector was extracted by calculating the statistical measures, graph theoretical measures, and matrix factorization results for the miRNA and disease, respectively. Then multiple base learnings were built to yield many decision trees (DTs) based on random selection of negative samples and miRNA/disease features. Particularly, Principal Components Analysis was applied to each base learning to reduce feature dimensionality and hence remove the noise or redundancy. Average strategy was adopted for these DTs to get final association scores between miRNAs and diseases. In model performance evaluation, EDTMDA showed AUC of 0.9309 in global leave-one-out cross validation (LOOCV) and AUC of 0.8524 in local LOOCV. Additionally, AUC of 0.9192+/-0.0009 in 5-fold cross validation proved the model's reliability and stability. Furthermore, three types of case studies for four human diseases were implemented. As a result, 94% (Esophageal Neoplasms), 86% (Kidney Neoplasms), 96% (Breast Neoplasms) and 88% (Carcinoma Hepatocellular) of top 50 predicted miRNAs were confirmed by experimental evidences in literature.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Review Biochemical Research Methods

Drug-pathway association prediction: from experimental results to computational models

Chun-Chun Wang, Yan Zhao, Xing Chen

Summary: Efforts are needed to develop effective drugs for complex diseases. Traditional drug discovery methods are time-consuming and costly, leading to the proposal of pathway-based drug discovery. Computational models have been established to predict drug-pathway associations, facilitating the development of new drugs.

BRIEFINGS IN BIOINFORMATICS (2021)

Review Biochemical Research Methods

Microbes and complex diseases: from experimental results to computational models

Yan Zhao, Chun-Chun Wang, Xing Chen

Summary: Research has shown that the number of microbes in the human body is almost 10 times higher than the number of cells, and they play crucial roles in immune function, digestion, and metabolism. Recent studies have revealed close relationships between noncommunicable diseases and microbes, providing new insights into disease pathogenesis. Computational models have been developed to predict disease-related microbes, potentially revolutionizing disease diagnosis, treatment, and drug development.

BRIEFINGS IN BIOINFORMATICS (2021)

Article Biochemical Research Methods

Ensemble of kernel ridge regression-based small molecule-miRNA association prediction in human disease

Chun-Chun Wang, Chi-Chi Zhu, Xing Chen

Summary: MicroRNAs (miRNAs) play important roles in human disease, and identifying SM-miRNA associations is crucial for drug development and treatment. This study proposes EKRRSMMA, a method that combines feature dimensionality reduction and ensemble learning to predict potential SM-miRNA associations. Evaluation and case studies confirm the reliability of EKRRSMMA.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

Dual-Network Collaborative Matrix Factorization for predicting small molecule-miRNA associations

Shu-Hao Wang, Chun-Chun Wang, Li Huang, Lian-Ying Miao, Xing Chen

Summary: In this study, a novel method called Dual-network Collaborative Matrix Factorization (DCMF) was proposed for predicting potential SM-miRNA associations. The method utilized the Weighted K Nearest Known Neighbors (WKNKN) method to preprocess the association matrix and introduced a dual network to incorporate more diverse similarity information. The effectiveness of DCMF was evaluated through cross validations and case studies, achieving high AUC values.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

Prediction of potential miRNA-disease associations based on stacked autoencoder

Chun-Chun Wang, Tian-Hao Li, Li Huang, Xing Chen

Summary: In recent years, miRNA has been shown to play an important role in the development of human complex diseases. This study introduces a computational model called SAEMDA, which utilizes computational methods based on biological data to discover miRNA-disease associations. SAEMDA is able to make full use of the feature information of all unlabeled miRNA-disease pairs and is suitable for datasets with small labeled samples and large unlabeled samples. Experimental results show that SAEMDA outperforms previous models in terms of predictive accuracy.

BRIEFINGS IN BIOINFORMATICS (2022)

Editorial Material Biochemical Research Methods

Computational model for ncRNA research

Xing Chen, Li Huang

BRIEFINGS IN BIOINFORMATICS (2022)

Review Biochemical Research Methods

Updated review of advances in microRNAs and complex diseases: experimental results, databases, webservers and data fusion

Li Huang, Li Zhang, Xing Chen

Summary: MicroRNAs (miRNAs) are important gene regulators in the pathogenesis of complex diseases and have potential applications in diagnosis and therapy. Accurate discovery of miRNA-disease associations (MDAs) is crucial for effective miRNA therapy. This review revisits miRNA biogenesis, detection techniques, and functions, summarizes recent experimental findings related to common miRNA-associated diseases, introduces updates of relevant databases and web servers, and discusses the contribution of diverse data sources to accurate MDA prediction.

BRIEFINGS IN BIOINFORMATICS (2022)

Review Biochemical Research Methods

Updated review of advances in microRNAs and complex diseases: towards systematic evaluation of computational models

Li Huang, Li Zhang, Xing Chen

Summary: There is currently no widely accepted strategy for evaluating computational models for microRNA-disease associations (MDAs). The evaluation methods and procedures are often determined on a case-by-case basis and depend on the choices of researchers. This review provides a comprehensive analysis of the evaluation methods used for 29 state-of-the-art models predicting MDAs and recommends a feasible evaluation workflow for future models.

BRIEFINGS IN BIOINFORMATICS (2022)

Review Biochemical Research Methods

Updated review of advances in microRNAs and complex diseases: taxonomy, trends and challenges of computational models

Li Huang, Li Zhang, Xing Chen

Summary: In this review, 29 state-of-the-art models for microRNA-disease association (MDA) prediction based on model fusion and non-fusion are presented. The new taxonomy demonstrates changes in the algorithmic architecture of models compared to earlier classifications. Furthermore, the progress made in overcoming obstacles to effective MDA prediction since 2017 is discussed, and future research directions are proposed for enhancing model performance.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

Predicting drug-target binding affinity through molecule representation block based on multi-head attention and skip connection

Li Zhang, Chun-Chun Wang, Xing Chen

Summary: This study presents a novel model called MRBDTA to improve the existing computational models for drug-target binding affinity prediction. MRBDTA achieves better performance in prediction accuracy and can provide interpretability analysis. The case studies demonstrate the reliable performance of MRBDTA in drug design for SARS-CoV-2.

BRIEFINGS IN BIOINFORMATICS (2022)

Editorial Material Biochemical Research Methods

Computational model for disease research

Xing Chen, Li Huang

BRIEFINGS IN BIOINFORMATICS (2023)

Article Biochemical Research Methods

SNRMPACDC: computational model focused on Siamese network and random matrix projection for anticancer synergistic drug combination prediction

Tian-Hao Li, Chun-Chun Wang, Li Zhang, Xing Chen

Summary: Synergistic drug combinations can improve therapeutic effect and reduce toxicity. Computational methods are efficient tools for predicting potential synergistic drug combinations. We developed a new model called SNRMPACDC, which achieved better results in predicting anticancer synergistic drug combinations.

BRIEFINGS IN BIOINFORMATICS (2022)

Article Biochemical Research Methods

MCFF-MTDDI: multi-channel feature fusion for multi-typed drug-drug interaction prediction

Chen-Di Han, Chun-Chun Wang, Li Huang, Xing Chen

Summary: Adverse drug-drug interactions (DDIs) have become a serious problem in healthcare. Researchers have proposed a Multi-Channel Feature Fusion model for multi-typed DDI prediction (MCFF-MTDDI), which effectively fuses different features extracted from drug chemical structure, drug pairs' extra label, and drug knowledge graph (KG) to predict multi-typed DDIs. The results of experiments on multiple datasets demonstrate the effectiveness of MCFF-MTDDI.

BRIEFINGS IN BIOINFORMATICS (2023)

Article Biology

Deciphering ligand-receptor-mediated intercellular communication based on ensemble deep learning and the joint scoring strategy from single-cell transcriptomic data

Lihong Peng, Jingwei Tan, Wei Xiong, Li Zhang, Zhao Wang, Ruya Yuan, Zejun Li, Xing Chen

Summary: The study introduces a new deep learning framework, CellComNet, which deciphers cell-cell communication mediated by extracellular molecules through the analysis of single-cell transcriptomic data. The framework demonstrates efficient identification of credible LRIs and significantly improves the inference performance of cell-cell communication. It has the potential to contribute to anticancer drug design and tumor-targeted therapy.

COMPUTERS IN BIOLOGY AND MEDICINE (2023)

Article Biochemical Research Methods

CellEnBoost: A Boosting-Based Ligand-Receptor Interaction Identification Model for Cell-to-Cell Communication Inference

Lihong Peng, Ruya Yuan, Chendi Han, Guosheng Han, Jingwei Tan, Zhao Wang, Min Chen, Xing Chen

Summary: Cell-to-cell communication (CCC) plays significant roles in multicellular organisms, especially in cancer genesis, development, and metastasis. This manuscript presents a Boosting-based LRI identification model (CellEnBoost) for predicting and interpreting ligand-receptor interactions in CCC. Experimental results demonstrate the superior performance of this model and its validation in human head and neck squamous cell carcinoma tissues.

IEEE TRANSACTIONS ON NANOBIOSCIENCE (2023)

暂无数据