期刊
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES
卷 22, 期 23, 页码 -出版社
MDPI
DOI: 10.3390/ijms222313124
关键词
umami peptide; sequence analysis; bioinformatics; machine learning; feature representation learning
资金
- College of Arts, Media and Technology, Chiang Mai University
- National Research Foundation of Korea (NRF) - Korean government (MSIT) [2021R1A2C1014338]
- Chiang Mai University
- Mahidol University
A novel machine-learning meta-predictor UMPred-FRL was developed for improved umami peptide identification, combining six machine learning algorithms and seven feature encodings to achieve more accurate performance compared to baseline models.
Umami ingredients have been identified as important factors in food seasoning and production. Traditional experimental methods for characterizing peptides exhibiting umami sensory properties (umami peptides) are time-consuming, laborious, and costly. As a result, it is preferable to develop computational tools for the large-scale identification of available sequences in order to identify novel peptides with umami sensory properties. Although a computational tool has been developed for this purpose, its predictive performance is still insufficient. In this study, we use a feature representation learning approach to create a novel machine-learning meta-predictor called UMPred-FRL for improved umami peptide identification. We combined six well-known machine learning algorithms (extremely randomized trees, k-nearest neighbor, logistic regression, partial least squares, random forest, and support vector machine) with seven different feature encodings (amino acid composition, amphiphilic pseudo-amino acid composition, dipeptide composition, composition-transition-distribution, and pseudo-amino acid composition) to develop the final meta-predictor. Extensive experimental results demonstrated that UMPred-FRL was effective and achieved more accurate performance on the benchmark dataset compared to its baseline models, and consistently outperformed the existing method on the independent test dataset. Finally, to aid in the high-throughput identification of umami peptides, the UMPred-FRL web server was established and made freely available online. It is expected that UMPred-FRL will be a powerful tool for the cost-effective large-scale screening of candidate peptides with potential umami sensory properties.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据