4.7 Article

Malsite-Deep: Prediction of protein malonylation sites through deep learning and multi-information fusion based on NearMiss-2 strategy

Journal

KNOWLEDGE-BASED SYSTEMS
Volume 240, Issue -, Pages -

Publisher

ELSEVIER
DOI: 10.1016/j.knosys.2022.108191

Keywords

Malonylation; Multi-information fusion; NearMiss-2; Gated recurrent units; Deep neural networks

Funding

  1. National Natural Science Foundation of China [62172248]
  2. Natural Science Foundation of Shandong Province of China [ZR2021MF098]

Ask authors/readers for more resources

In this paper, a new prediction model, Malsite-Deep, is proposed for predicting protein malonylation sites. The model combines feature extraction and deep neural networks to achieve accurate predictions, and its performance is evaluated on multiple test sets.
Malonylation is a new protein post-translational modification and regulates a variety of cellular physiological processes. However, it is costly and time-consuming to identify malonylation sites through traditional experiments. Therefore, the prediction of malonylation sites by computational methods plays an important role in experimental design. In this paper, a new prediction model of malonylation sites, Malsite-Deep, is proposed. First, the seven feature extraction methods are used to extract feature information of protein sequences. Then, the under-sampling NearMiss-2 method is applied to handle imbalance data, and the update gate and reset gate of gated recurrent units (GRU) are used to select the optimal feature subset. Finally, the data from GRU layer is input into deep neural networks (DNN) to predict the malonylation sites, and the model performance is evaluated by 10-fold cross-validation and independent test sets. The 10-fold cross-validation shows that the AUC value on the training dataset reaches 0.99. The AUC values on the four independent test datasets all reach above 0.95. Results suggest that Malsite-Deep presented here facilitates the identification of protein malonylation sites.(c) 2022 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available