☆ 4.6 Article

e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness

FRONTIERS IN CHEMISTRY (2019)

期刊

FRONTIERS IN CHEMISTRY

卷 7, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA

DOI: 10.3389/fchem.2019.00035

关键词

sweet taste; sweetener prediction; relative sweetness prediction; machine learning method; QSAR

类别

Chemistry, Multidisciplinary

资金

Natural Science Foundation of Zhejiang Province [Q7LY17B030007]
National Natural Science Foundation of China [21502144]
Wenzhou Medical University

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

摘要

Artificial sweeteners (AS) can elicit the strong sweet sensation with the low or zero calorie, and are widely used to replace the nutritive sugar in the food and beverage industry. However, the safety issue of current AS is still controversial. Thus, it is imperative to develop more safe and potent AS. Due to the costly and laborious experimental-screening of AS, in-silico sweetener/sweetness prediction could provide a good avenue to identify the potential sweetener candidates before experiment. In this work, we curate the largest dataset of 530 sweeteners and 850 non-sweeteners, and collect the second largest dataset of 352 sweeteners with the relative sweetness (RS) from the literature. In light of these experimental datasets, we adopt five machine-learning methods and conformational-independent molecular fingerprints to derive the classification and regression models for the prediction of sweetener and its RS, respectively via the consensus strategy. Our best classification model achieves the 95% confidence intervals for the accuracy (0.91 +/- 0.01), precision (0.90 +/- 0.01), specificity (0.94 +/- 0.01), sensitivity (0.86 +/- 0.01), F1-score (0.88 +/- 0.01), and NER (Non-error Rate: 0.90 +/- 0.01) on the test set, which outperforms the model (NER = 0.85) of Rojas et al. in terms of NER, and our best regression model gives the 95% confidence intervals for the R-2 (test set) and Delta R-2 [referring to vertical bar R-2(test set)-R-2(cross-validation)vertical bar] of 0.77 +/- 0.01 and 0.03 +/- 0.01, respectively, which is also better than the other works based on the conformation-independent 2D descriptors (e.g., 2D Dragon) according to R-2(test set) and Delta R-2. Our models are obtained by averaging over nineteen data-splitting schemes, and fully comply with the guidelines of Organization for Economic Cooperation and Development (OECD), which are not completely followed by the previous relevant works that are all on the basis of only one random data-splitting scheme for the cross-validation set and test set. Finally, we develop a user-friendly platform e-Sweet for the automatic prediction of sweetener and its corresponding RS. To our best knowledge, it is a first and free platform that can enable the experimental food scientists to exploit the current machine-learning methods to boost the discovery of more AS with the low or zero calorie content.

e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness

期刊

FRONTIERS IN CHEMISTRY

出版社

FRONTIERS MEDIA SA

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

e-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and Its Relative Sweetness

期刊

FRONTIERS IN CHEMISTRY

出版社

FRONTIERS MEDIA SA

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文