4.7 Article

NMFCDA: Combining randomization-based neural network with non-negative matrix factorization for predicting CircRNA-disease association

Journal

APPLIED SOFT COMPUTING
Volume 110, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.asoc.2021.107629

Keywords

CircRNA-disease association; Randomization-based neural network; Non-negative matrix factorization; Gaussian interaction profile; Natural language processing

Funding

  1. National Natural Science Foundation of China [61702444]
  2. West Light Foundation of The Chinese Academy of Sciences [2018-XBQNXZ-B-008]
  3. Chinese Postdoctoral Science Foundation [2019M653804]
  4. Tianshan youth-Excellent Youth, China [2019Q029]
  5. Qingtan scholar talent project of Za-ozhuang University, China

Ask authors/readers for more resources

Recent studies show the close relationship between circRNA and human diseases. A novel computational framework NMFCDA is proposed for predicting circRNA-disease associations, achieving high prediction accuracy and AUC in benchmark data set and providing theoretical basis and reliable circRNA candidates for biological experiments.
Recent studies suggest that circRNA is closely related to the occurrence and development of human diseases, and it has great application prospects in the field of disease diagnostic markers. However, restricted by the environment and conditions, it is usually time-consuming and labor-intensive to use biological experimental methods to identify the association between circRNA and disease. In this study, we propose a novel computational framework NMFCDA that combines randomization based neural network Pseudoinverse Learning (PIL) with Non-Negative Matrix Factorization (NMF) to predict circRNA-disease associations. The model first fuses circRNA natural language sequence information, disease semantic information, and circRNA and disease Gaussian interaction profile (GIP) kernel similarity information into a unified matrix, then uses NMF algorithm to obtain its key features, and finally uses randomization-based PIL to search for the global optimal solution to accurately predict the association between circRNA and disease. In the benchmark data set circR2Disease, NMFCDA achieved a prediction accuracy of 92.56% and an AUC of 0.9278, significantly higher than other classifier models and previous existing methods. Furthermore, 26 of the top 30 disease-associated circRNAs with the highest predictive scores were confirmed by the relevant literature. These results indicate that NMFCDA can be used as a useful prediction tool to provide theoretical basis and reliable circRNA candidates for biological experiments. (C) 2021 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available