4.5 Article

FTWSVM-SR: DNA-Binding Proteins Identification via Fuzzy Twin Support Vector Machines on Self-Representation

出版社

SPRINGER HEIDELBERG
DOI: 10.1007/s12539-021-00489-6

关键词

DNA-binding proteins; Fuzzy membership; Self-representation; Fuzzy twin support vector machine; Multiple kernel fusion

资金

  1. National Science Foundation of China [NSFC 61873112, 61922020, 62172076, 61902271]
  2. Special Science Foundation of Quzhou [2021D004]

向作者/读者索取更多资源

In this study, a fuzzy twin support vector machine (FTWSVM) is utilized along with multiple kernel learning (MKL) algorithm and self-representation-based membership function to detect DNA-binding proteins (DBPs), achieving good performance on two independent testing sets.
Due to the high cost of DNA-binding proteins (DBPs) detection, many machine learning algorithms (ML) have been utilized to large-scale process and detect DBPs. The previous methods took no count of the processing of noise samples. In this study, a fuzzy twin support vector machine (FTWSVM) is employed to detect DBPs. First, multiple types of protein sequence features are formed into kernel matrices; Then, multiple kernel learning (MKL) algorithm is utilized to linear combine multiple kernels; next, self-representation-based membership function is utilized to estimate membership value (weight) of each training sample; finally, we feed the integrated kernel matrix and membership values into the FTWSVM-SR model for training and testing. On comparison with other predictive models, FTWSVM based on SR (FTWSVM-SR) obtains the best performance of Matthew's correlation coefficient (MCC): 0.7410 and 0.5909 on two independent testing sets (PDB186 and PDB2272 datasets), respectively. The results confirm that our method can be an effective DBPs detection tool. Before the biochemical experiment, our model can screen and analyze DBPs on a large scale.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.5
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据