☆ 4.8 Article

iBCE-EL: A New Ensemble Learning Framework for Improved Linear B-Cell Epitope Prediction

FRONTIERS IN IMMUNOLOGY (2018)

期刊

FRONTIERS IN IMMUNOLOGY

卷 9, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA

DOI: 10.3389/fimmu.2018.01695

关键词

B-cell epitope; ensemble learning; extremely randomized tree; gradient boosting; immunotherapy

类别

Immunology

资金

Basic Science Research Program through the National Research Foundation (NRF) of Korea - Ministry of Education, Science, and Technology [2018R1D1A1B07049572, 2009-0093826]
Brain Research Program through the National Research Foundation of Korea (NRF) - Ministry of Science, ICT, and Future Planning [2016M3C7A1904392]

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Identification of B-cell epitopes (BCEs) is a fundamental step for epitope-based vaccine development, antibody production, and disease prevention and diagnosis. Due to the avalanche of protein sequence data discovered in postgenomic age, it is essential to develop an automated computational method to enable fast and accurate identification of novel BCEs within vast number of candidate proteins and peptides. Although several computational methods have been developed, their accuracy is unreliable. Thus, developing a reliable model with significant prediction improvements is highly desirable. In this study, we first constructed a non-redundant data set of 5,550 experimentally validated BCEs and 6,893 non-BCEs from the Immune Epitope Database. We then developed a novel ensemble learning framework for improved linear BCE predictor called iBCE-EL, a fusion of two independent predictors, namely, extremely randomized tree (ERT) and gradient boosting (GB) classifiers, which, respectively, uses a combination of physicochemical properties (PCP) and amino acid composition and a combination of dipeptide and PCP as input features. Cross-validation analysis on a benchmarking data set showed that iBCE-EL performed better than individual classifiers (ERT and GB), with a Matthews correlation coefficient (MCC) of 0.454. Furthermore, we evaluated the performance of iBCE-EL on the independent data set. Results show that iBCE-EL significantly outperformed the state-of-the-art method with an MCC of 0.463. To the best of our knowledge, iBCE-EL is the first ensemble method for linear BCEs prediction. iBCE-EL was implemented in a web-based platform, which is available at http://thegleelab.org/ iBCE-EL. iBCE-EL contains two prediction modes. The first one identifying peptide sequences as BCEs or non-BCEs, while later one is aimed at providing users with the option of mining potential BCEs from protein sequences.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.8

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

MTBR: Multi-Target Boosting for Regression

Sangdi Lin, Bahareh Azarnoush, George Runger

Summary: This paper proposes a multi-target boosting method, named MTBR, for regression problems. Although it builds models separately for each target attribute, all target attributes are utilized when building each model by selecting the best models from all target attributes in each boosting iteration. The novel knowledge transfer approach introduced in this method uses the tree structure learned from one target attribute to predict another, proving the effectiveness of MTBR in leveraging knowledge from multiple target attributes and improving model accuracy through experiments with six datasets.

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (2021)