☆ 4.7 Article

Benchmarking machine learning methods for modeling physical properties of ionic liquids

JOURNAL OF MOLECULAR LIQUIDS (2022)

期刊

JOURNAL OF MOLECULAR LIQUIDS

卷 351, 期 -, 页码 -

出版社

ELSEVIER

DOI: 10.1016/j.molliq.2022.118616

关键词

Ionic liquids; Machine learning; Neural networks; QSPR; OCHEM

类别

Chemistry, Physical Physics, Atomic, Molecular & Chemical

资金

Israeli Ministry of Aliyah and Integration
Israel National Research Center for Electrochemical Propulsion (INREP)
Grand Technion Energy Program (GTEP), Israel

向作者/读者索取更多资源

Protocol

社区支持

Reagent

社区支持

智能总结 New
摘要

This study conducted a large-scale benchmarking analysis to explore the quantitative prediction of properties of ionic liquids using machine learning methods. The best combinations of ML methods and molecular representations were identified, with a focus on the performance of nonlinear ML methods, neural networks, and Transformers. The study demonstrated the advanced ability of Transformers in analyzing the chemical structures of ionic liquids encoded in SMILES text strings.

The great importance of the ability to quantitatively predict the properties of ionic liquids (ILs) using quantitative structure-property relationships (QSPR) models necessitates the understanding of which modern machine learning (ML) methods in combination with which types of molecular representations are preferable to use for this purpose. To address this problem, a large-scale benchmarking study of QSPR models built by combining three traditional ML methods and neural networks with seven different architectures with five types of molecular representations (in the form of either numerical molecular descriptors or SMILES text strings) to predict six important physical properties of ILs (density, electrical conductance, melting point, refractive index, surface tension, and viscosity) was carried out. The datasets include from 407 to 1204 diverse ILs composed of various organic and inorganic ions. QSPR models for predicting the properties of ILs at eight different temperatures were built using multi-task learning. The best combinations of ML methods and molecular representations were identified for each of the properties. A unified ranking system was introduced to rank and prioritize different ML methods and molecular representations. It was shown in this study that on average: (i) nonlinear ML methods perform much better than linear ones, (ii) neural networks perform better than traditional ML methods, (iii) Transformers, which are actively used in natural language processing (NLP), perform better than other types of neural networks due to the advanced ability to analyze chemical structures of ILs encoded into SMILES text strings. A special component-wise cross-validation scheme was applied to assess how much the predictive performance deteriorates for the ILs composed of cations and anions that are not present in the dataset. (C) 2022 Elsevier B.V. All rights reserved.

Benchmarking machine learning methods for modeling physical properties of ionic liquids

期刊

JOURNAL OF MOLECULAR LIQUIDS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

Benchmarking machine learning methods for modeling physical properties of ionic liquids

期刊

JOURNAL OF MOLECULAR LIQUIDS

出版社

ELSEVIER

关键词

类别

资金

向作者/读者索取更多资源

Protocol

Reagent

作者

我是这篇论文的作者

评论

主要评分

次要评分

新颖性

重要性

科学严谨性

评价这篇论文

推荐

导出引文

分享论文