Journal
PROTEIN JOURNAL
Volume 29, Issue 1, Pages 62-67Publisher
SPRINGER
DOI: 10.1007/s10930-009-9222-z
Keywords
Support vector machine; Auto covariance; Protein structural class; Pseudo-amino acid composition; Ensemble classifier
Categories
Funding
- National Natural Science Foundation of China [20775052]
Ask authors/readers for more resources
The purpose of this article is to identify protein structural classes by using support vector machine (SVM) ensemble classifier, which is very efficient in enhancing prediction performance. Firstly, auto covariance (AC) and pseudo-amino acid composition (PseAAC) were used in protein representation. AC focuses on adjacent effects and PseAA composition takes sequence order patterns into account. Secondly, SVMs were trained on the datasets represented by different descriptors. The last, ensemble classifier, which constructed on the individual classifiers through a voting strategy, gave the final prediction results. Meanwhile, very promising prediction accuracy 93.14% was obtained by Jackknife test. The experimental results showed that the ensemble system can improve the prediction performance greatly and generate more stable and safer predictors. The current method featured by fusing the protein primary sequence information transferred by AC and described by protein PseAA composition may play an important complementary role in other related applications.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available