☆ 4.7 Article

StackIL6: a stacking ensemble model for improving the prediction of IL-6 inducing peptides

BRIEFINGS IN BIOINFORMATICS (2021)

期刊

BRIEFINGS IN BIOINFORMATICS

卷 22, 期 6, 页码 -

出版社

OXFORD UNIV PRESS

DOI: 10.1093/bib/bbab172

关键词

interleukin 6; IL-6; bioinformatics; sequence analysis; machine learning; ensemble learning

类别

Biochemical Research Methods Mathematical & Computational Biology

资金

Chiang Mai University, Mahidol University [MRG6180226]
TRF Research Career Development Grant [RSA6280075]
National Research Foundation of Korea (NRF) - Korean government (MSIT) [2021R1A2C1014338, 2018R1D1A1B07049572]
National Research Foundation of Korea [2018R1D1A1B07049572, 2021R1A2C1014338] Funding Source: Korea Institute of Science & Technology Information (KISTI), National Science & Technology Information Service (NTIS)

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

IL-6 release is stimulated by antigenic peptides and immune cells, making IL-6 inducing peptides useful not only as diagnostic biomarkers, but also as inhibitors for aggressive immune responses. A novel stacking ensemble model, StackIL6, was developed using twelve feature descriptors and five machine learning algorithms, showing better performance than existing methods in identifying IL-6 inducing peptides. Accessible through a web server, StackIL6 has the potential to aid in the rapid screening of promising peptides for diagnostic and immunotherapeutic applications.

The release of interleukin (IL)-6 is stimulated by antigenic peptides from pathogens as well as by immune cells for activating aggressive inflammation. IL-6 inducing peptides are derived from pathogens and can be used as diagnostic biomarkers for predicting various stages of disease severity as well as being used as IL-6 inhibitors for the suppression of aggressive multi-signaling immune responses. Thus, the accurate identification of IL-6 inducing peptides is of great importance for investigating their mechanism of action as well as for developing diagnostic and immunotherapeutic applications. This study proposes a novel stacking ensemble model (termed StackIL6) for accurately identifying IL-6 inducing peptides. More specifically, StackIL6 was constructed from twelve different feature descriptors derived from three major groups of features (composition-based features, composition-transition-distribution-based features and physicochemical properties-based features) and five popular machine learning algorithms (extremely randomized trees, logistic regression, multi-layer perceptron, support vector machine and random forest). To enhance the utility of baseline models, they were effectively and systematically integrated through a stacking strategy to build the final meta-based model. Extensive benchmarking experiments demonstrated that StackIL6 could achieve significantly better performance than the existing method (IL6PRED) and outperformed its constituent baseline models on both training and independent test datasets, which thereby support its excellent discrimination and generalization abilities. To facilitate easy access to the StackIL6 model, it was established as a freely available web server accessible at http://camt.pythonanywhere.com/StackIL6. It is anticipated that StackIL6 can help to facilitate rapid screening of promising IL-6 inducing peptides for the development of diagnostic and immunotherapeutic applications in the future.

作者

我是这篇论文的作者

点击您的名字以认领此论文并将其添加到您的个人资料中。

主要评分

4.7

评分不足

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

Integrative machine learning framework for the identification of cell-specific enhancers from the human genome

Shaherin Basith, Md Mehedi Hasan, Gwang Lee, Leyi Wei, Balachandran Manavalan

Summary: Enhancers are DNA fragments that enhance the transcription of related genes when bound by transcription factors. Identifying enhancers from the human genome is challenging due to their sporadic distribution and similar fractions. The proposed integrative machine learning framework, Enhancer-IF, showed excellent prediction performance in identifying cell-specific enhancers across different cell types, highlighting its superiority in enhancer identification.