4.6 Article

Accurate Thermochemistry with Small Data Sets: A Bond Additivity Correction and Transfer Learning Approach

期刊

JOURNAL OF PHYSICAL CHEMISTRY A
卷 123, 期 27, 页码 5826-5835

出版社

AMER CHEMICAL SOC
DOI: 10.1021/acs.jpca.9b04195

关键词

-

资金

  1. ExxonMobil [EM09079]
  2. National Energy Research Scientific Computing Center (NERSC), a U.S. Department of Energy Office of Science User Facility [DE-AC02-05CH11231]

向作者/读者索取更多资源

Machine learning provides promising new methods for accurate yet rapid prediction of molecular properties, including thermochemistry, which is an integral component of many computer simulations, particularly automated reaction mechanism generation. Often, very large data sets with tens of thousands of molecules are required for training the models, but most data sets of experimental or high-accuracy quantum mechanical quality are much smaller. To overcome these limitations, we calculate new high-level data sets and derive bond additivity corrections to significantly improve enthalpies of formation. We adopt a transfer learning technique to train neural network models that achieve good performance even with a relatively small set of high-accuracy data. The training data for the entropy model are carefully selected so that important conformational effects are captured. The resulting models are generally applicable thermochemistry predictors for organic compounds with oxygen and nitrogen heteroatoms that approach experimental and coupled cluster accuracy while only requiring molecular graph inputs. Due to their versatility and the ease of adding new training data, they are poised to replace conventional estimation methods for thermochemical parameters in reaction mechanism generation. Since high-accuracy data are often sparse, similar transfer learning approaches are expected to be useful for estimating many other molecular properties.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Thermodynamics

A wide range experimental and kinetic modeling study of the oxidation of 2,3-dimethyl-2-butene: Part 1

Jinhu Liang, Ruining He, Shashank S. Nagaraja, A. Abd El-Sabor Mohamed, Haitao Lu, Yousef M. Almarzooq, Xiaorui Dong, Olivier Mathieu, William H. Green, Eric L. Petersen, S. Mani Sarathy, Henry J. Curran

Summary: This study investigates the combustion characteristics of 2,3-dimethyl-2-butene (TME) and develops a detailed chemical kinetic model to describe its combustion. Measurements of ignition delay times and laminar flame speeds were conducted, and two kinetic mechanisms were constructed for simulating the experimental results.

COMBUSTION AND FLAME (2023)

Article Chemistry, Physical

Automated reaction kinetics and network exploration (Arkane): A statistical mechanics, thermodynamics, transition state theory, and master equation software

Alon Grinberg Dana, Matthew S. Johnson, Joshua W. Allen, Sandeep Sharma, Sumathy Raman, Mengjie Liu, Connie W. Gao, Colin A. Grambow, Mark J. Goldman, Duminda S. Ranasinghe, Ryan J. Gillis, A. Mark Payne, Yi-Pei Li, Xiaorui Dong, Kevin A. Spiekermann, Haoyang Wu, Enoch E. Dames, Zachary J. Buras, Nick M. Vandewiele, Nathan W. Yee, Shamel S. Merchant, Beat Buesser, Caleb A. Class, Franklin Goldsmith, Richard H. West, William H. Green

Summary: The open-source software Arkane enables computations of thermodynamic properties, reaction rate coefficients, and pressure-dependent kinetics for complex chemical systems. It uses estimates to fill in missing quantum chemistry information and solves the internal energy master equation. The software supports various electronic structure computation packages and provides high-pressure limit rate coefficients, thermodynamic properties, and energy corrections.

INTERNATIONAL JOURNAL OF CHEMICAL KINETICS (2023)

Article Chemistry, Medicinal

Characterizing Uncertainty in Machine Learning for Chemistry

Esther Heid, Charles J. McGill, Florence H. Vermeire, William H. Green

Summary: Characterizing uncertainty in machine learning models has gained interest recently for machine learning reliability, robustness, safety, and active learning. The total uncertainty is separated into contributions from data noise (aleatoric) and model shortcomings (epistemic), further dividing epistemic uncertainty into model bias and variance contributions. The influence of noise, model bias, and model variance in chemical property predictions is systematically addressed, revealing important trends in model performance associated with various factors such as noise level, data set size, model architecture, and ensemble size.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Chemistry, Multidisciplinary

Fast Water Transport in UTSA-280 via a Knock-Off Mechanism

Cheng-Hsun Hsu, Hsin-Yu Yu, Ho Jun Lee, Pei-Hao Wu, Shing-Jong Huang, Jong Suk Lee, Tsyr-Yan Yu, Yi-Pei Li, Dun-Yen Kang

Summary: This study uncovers a unique water transfer mechanism, called the knock-off mechanism, in UTSA-280. Through this mechanism, water molecules are able to displace and transport coordinated molecules, leading to high water flux and improved selectivity. Experimental and simulation results demonstrate that the knock-off mechanism enhances water diffusion and simulates pseudo-three-dimensional mass transfer in UTSA-280. Solid-state NMR and density functional theory provide further evidence and atomic-level insight into this mechanism.

ANGEWANDTE CHEMIE-INTERNATIONAL EDITION (2023)

Review Energy & Fuels

Perspective on Decarbonizing Long-Haul Trucks Using Onboard Dehydrogenation of Liquid Organic Hydrogen Carriers

Sayandeep Biswas, Kariana Moreno Sader, William H. H. Green

Summary: The alternative concept of onboard hydrogen release shows promise in decarbonizing long-haul trucking, achieving cost parity with diesel and reducing greenhouse gas emissions. However, further research and development are needed to optimize the efficiency and integration of core components.

ENERGY & FUELS (2023)

Article Engineering, Chemical

Detailed Multiphase Chemical Kinetic Model for Polymer Fouling in a Distillation Column

Hao-Wei Pang, Michael Forsuelo, Xiaorui Dong, Ryan E. Hawtof, Duminda S. Ranasinghe, William H. Green

Summary: Polymer fouling is a common yet unresolved issue in the separation process downstream of a steam cracker, and a mechanistic understanding is needed to mitigate it. This study introduces a multiphase chemical kinetic model to predict the growth rate of polymer film in an industrial debutanizer distillation column. The model incorporates thousands of chemical reactions, hundreds of species in vapor-liquid equilibria, transport between phases, and flows between trays, which significantly influence fouling rate. The model is validated using experimental measurements, clarifying the mechanistic details of the fouling process and discussing challenges in predicting polymer fouling in industrial units.

INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH (2023)

Article Chemistry, Medicinal

Predicting Critical Properties and Acentric Factors of Fluids Using Multitask Machine Learning

Sayandeep Biswas, Yunsie Chung, Josephine Ramirez, Haoyang Wu, William H. H. Green

Summary: This study developed a machine learning model that can predict critical properties and acentric factors of chemical compounds based on their SMILES representation, replacing expensive and time-consuming experiments. The model uses D-MPNN and graph attention network architectures, and incorporates additional atomic and molecular features, multitask training, and pretraining to optimize performance. It achieves state-of-the-art accuracies on both random and scaffold splits, and the dataset of critical properties and acentric factors is now publicly available along with the source code.

JOURNAL OF CHEMICAL INFORMATION AND MODELING (2023)

Article Chemistry, Physical

On the accuracy of the chemically significant eigenvalue method

Flemming Holtorf, William H. Green

Summary: This paper examines the accuracy and convergence properties of the chemically significant eigenvalues method and its related dominant subspace truncation method in the energy-grained master equation. It proposes a viable alternative method for cases where the aforementioned methods may fall short.

JOURNAL OF CHEMICAL PHYSICS (2023)

Article Chemistry, Physical

Computing Kinetic Solvent Effects and Liquid Phase Rate Constants Using Quantum Chemistry and COSMO-RS Methods

Yunsie Chung, William H. Green

Summary: This study assesses the accuracies of various quantum chemical and COSMO-RS levels of theory for the predictions of liquid phase rate constants and kinetic solvent effects. The results show that the omegaB97XD/def2-TZVP level of theory combined with the COSMO-RS method at the BP-TZVP level achieves the best performance.

JOURNAL OF PHYSICAL CHEMISTRY A (2023)

Article Engineering, Chemical

Developing machine learning models for accurate prediction of radiative efficiency of greenhouse gases

Balaganesh Muthiah, Shih-Cheng Li, Yi-Pei Li

Summary: This study predicts the radiative efficiency (RE) of greenhouse gases using machine learning models. An RE dataset was generated using a narrow band model and density functional theory, and validated against experimental data. The directed message-passing neural network (DMPNN) model outperformed other models, and combining it with quantum mechanical features slightly improved the prediction performance. This research provides critical insights for developing machine learning models that can predict the RE of molecules, facilitating the design of environmentally-friendly chemicals and materials.

JOURNAL OF THE TAIWAN INSTITUTE OF CHEMICAL ENGINEERS (2023)

Article Engineering, Chemical

Detailed Multiphase Chemical Kinetic Model for Polymer Fouling in a Distillation Column

Hao-Wei Pang, Michael Forsuelo, Xiaorui Dong, Ryan E. Hawtof, Duminda S. Ranasinghe, William H. Green

Summary: Polymer fouling is a common problem in the separation train downstream of a steam cracker, and a mechanistic understanding of this process is crucial for mitigation. In this study, a multiphase chemical kinetic model was developed to predict the polymer film growth rate in an industrial debutanizer distillation column. The model incorporates thousands of chemical reactions, vapor-liquid equilibria, and phase transport, all of which significantly affect fouling rate. The model was validated using experimental measurements, and the mechanistic details of the fouling process were elucidated, highlighting the challenges in accurately predicting polymer fouling in industrial units.

INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH (2023)

Article Chemistry, Physical

Integrating Chemical Information into Reinforcement Learning for Enhanced Molecular Geometry Optimization

Yu-Cheng Chang, Yi-Pei Li

Summary: This study introduces a reinforcement-learning-based optimizer that achieves exceptional results in molecular geometry optimization, particularly in dealing with challenging initial geometries. The optimizer's versatility and potential for enhancements are highlighted, showing its promise in the field of computational chemistry.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2023)

Article Chemistry, Physical

Experimental Compilation and Computation of Hydration Free Energies for Ionic Solutes

Jonathan W. Zheng, William H. Green

Summary: This study presents an experiment-based aqueous ionic solvation energy dataset, IonSolv-Aq, which is 2 times larger than the commonly used 2012 Minnesota Solvation Database. The results indicate that the popular implicit solvation models COSMO-RS and SMD can be corrected for systematic errors in predicting solvation free energies of singly charged ionic solutes in water. These findings emphasize the importance of larger experimental data sets for improving solvation model parametrization and evaluating performance.

JOURNAL OF PHYSICAL CHEMISTRY A (2023)

Article Green & Sustainable Science & Technology

Surface characterization of cerium oxide catalysts using deep learning with infrared spectroscopy of CO

H. -Y. Yu, B. Muthiah, S. -C. Li, W. -Y. Yu, Y. -P. Li

Summary: This study utilized deep learning techniques and infrared spectroscopy as a probe molecule to investigate the surface properties of CeO2 catalysts. The research successfully predicted the experimental results of CO adsorption on different types of CeO2.

MATERIALS TODAY SUSTAINABILITY (2023)

暂无数据