4.8 Article

Benchmarking materials property prediction methods: the Matbench test set and Automatminer reference algorithm

期刊

NPJ COMPUTATIONAL MATERIALS
卷 6, 期 1, 页码 -

出版社

NATURE PORTFOLIO
DOI: 10.1038/s41524-020-00406-3

关键词

-

资金

  1. United States Department of Energy, Office of Basic Energy Sciences, Early Career Research Program
  2. DOE [DE-AC02-05CH11231]
  3. Office of Science, Office of Basic Energy Sciences, of the U.S. Department of Energy [DE-AC02-05CH11231]
  4. National Energy Research Scientific Computing Center (NERSC), a U.S. Department of Energy Office of Science User Facility [DE-AC02-05CH11231]

向作者/读者索取更多资源

We present a benchmark test suite and an automated machine learning procedure for evaluating supervised machine learning (ML) models for predicting properties of inorganic bulk materials. The test suite, Matbench, is a set of 13 ML tasks that range in size from 312 to 132k samples and contain data from 10 density functional theory-derived and experimental sources. Tasks include predicting optical, thermal, electronic, thermodynamic, tensile, and elastic properties given a material's composition and/or crystal structure. The reference algorithm, Automatminer, is a highly-extensible, fully automated ML pipeline for predicting materials properties from materials primitives (such as composition and crystal structure) without user intervention or hyperparameter tuning. We test Automatminer on the Matbench test suite and compare its predictive power with state-of-the-art crystal graph neural networks and a traditional descriptor-based Random Forest model. We find Automatminer achieves the best performance on 8 of 13 tasks in the benchmark. We also show our test suite is capable of exposing predictive advantages of each algorithm-namely, that crystal graph methods appear to outperform traditional machine learning methods given similar to 10(4) or greater data points. We encourage evaluating materials ML algorithms on the Matbench benchmark and comparing them against the latest version of Automatminer.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.8
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Chemistry, Inorganic & Nuclear

Benchmarking Coordination Number Prediction Algorithms on Inorganic Crystal Structures

Hillary Pan, Alex M. Ganose, Matthew Horton, Muratahan Aykol, Kristin A. Persson, Nils E. R. Zimmermann, Anubhav Jain

Summary: This study introduces the MaterialsCoord benchmark suite and CrystalNN near-neighbor algorithm, experimentally comparing CrystalNN with other algorithms and finding its performance to be similar to well-established ones. Additionally, computational demand and sensitivity to thermal motion simulations were evaluated for each algorithm.

INORGANIC CHEMISTRY (2021)

Correction Chemistry, Inorganic & Nuclear

Benchmarking Coordination Number Prediction Algorithms on Inorganic Crystal Structures (vol 60, pg 1590, 2021)

Hillary Pan, Alex M. Ganose, Matthew Horton, Muratahan Aykol, Kristin Persson, Nils E. R. Zimmermann, Anubhav Jain

INORGANIC CHEMISTRY (2021)

Article Multidisciplinary Sciences

Efficient calculation of carrier scattering rates from first principles

Alex M. Ganose, Junsoo Park, Alireza Faghaninia, Rachel Woods-Robinson, Kristin A. Persson, Anubhav Jain

Summary: The authors developed a computationally efficient method for calculating carrier scattering rates of semiconductors, which shows similar accuracy to state-of-the-art methods but at a much lower computational cost. This approach enables high-throughput computational workflows for accurate screening of carrier mobilities, lifetimes, and thermoelectric power.

NATURE COMMUNICATIONS (2021)

Article Materials Science, Multidisciplinary

Compromise between band structure and phonon scattering in efficient n-Mg3Sb2-xBix thermoelectrics

Xuemin Shi, Xinyue Zhang, Alexander Ganose, Junsoo Park, Cheng Sun, Zhiwei Chen, Siqi Lin, Wen Li, Anubhav Jain, Yanzhong Pei

Summary: The n-type Mg3Sb2-based materials have shown promising potential for thermoelectric applications due to their high band degeneracy, high carrier mobility, low lattice thermal conductivity, and environmental advantages. The addition of Mg3Bi2 alloy significantly improves their performance, but the optimal alloying concentrations vary with temperature due to a compromise between band structure and phonon scattering effects.

MATERIALS TODAY PHYSICS (2021)

Article Multidisciplinary Sciences

Thermal fluids with high specific heat capacity through reversible Diels-Alder reactions

Drew Lilley, Peiyuan Yu, Jason Ma, Anubhav Jain, Ravi Prasher

Summary: Thermal fluids play a crucial role in energy technologies, but current thermal fluids still have lower heat capacity than water and higher viscosity. The proposed thermochemical fluid has significantly higher heat capacity over a wide temperature range, lower viscosity than water, and a model is developed for designing future thermal fluids with even higher heat capacity.

ISCIENCE (2022)

Article Multidisciplinary Sciences

The defect challenge of wide-bandgap semiconductors for photovoltaics and beyond

Alex M. Ganose, David O. Scanlon, Aron Walsh, Robert L. Z. Hoye

Summary: The optoelectronic performance of wide-bandgap semiconductors is often inferior to that of their defect-tolerant small-bandgap counterparts. The authors highlight three main challenges that need to be addressed in order to mitigate the impact of defects in wide-bandgap semiconductors.

NATURE COMMUNICATIONS (2022)

Article Chemistry, Physical

Band versus Polaron: Charge Transport in Antimony Chalcogenides

Xinwei Wang, Alex M. Ganose, Sean R. Kavanagh, Aron Walsh

Summary: Antimony sulfide and selenide have emerged as promising absorbers for photovoltaic applications, with band-like charge-carrier transport governed by large polarons. The room-temperature carrier mobility is limited by scattering from polar phonon modes and sample defects.

ACS ENERGY LETTERS (2022)

Article Chemistry, Physical

Computational Prediction and Experimental Realization of Earth-Abundant Transparent Conducting Oxide Ga-Doped ZnSb2O6

Adam J. Jackson, Benjamin J. Parrett, Joe Willis, Alex M. Ganose, W. W. Winnie Leung, Yuhan Liu, Benjamin A. D. Williamson, Timur K. Kim, Moritz Hoesch, Larissa S. I. Veiga, Raman Kalra, Jens Neu, Charles A. Schmuttenmaer, Tien-Lin Lee, Anna Regoutz, Tung-Chun Lee, Tim D. Veal, Robert G. Palgrave, Robin Perry, David O. Scanlon

Summary: This study demonstrates the ideal transparent conducting oxide ZnSb2O6 and identifies gallium as the optimal dopant for achieving high conductivity and transparency. Experimental validation of computational predictions shows that Ga-doped ZnSb2O6 exhibits behavior consistent with a degenerate transparent conducting oxide.

ACS ENERGY LETTERS (2022)

Article Chemistry, Inorganic & Nuclear

Quantifying Local Atomic Distortions in UO2 Grain Boundaries: Correlation with Energetic and Electronic Properties

Guikai Zheng, Qi Wang, Jinfan Chen, Ruizhi Qiu, Min Zhu

Summary: This study uses density functional theory to investigate the local cluster structures, energetic stabilities, and electronic properties of UO2 grain boundaries (GBs). The presence of new medium-range U-O bonds in the GBs is highlighted, which is often overlooked in conventional coordination analysis. The Smooth Overlap of Atomic Positions (SOAP) method is used to quantitatively describe the distorted clusters, and a dissimilarity index is defined to measure the differences between the distorted and perfect structures. The results show that the medium-range SOAP dissimilarity is well correlated with the GB excess energy, and high-energy GBs have shortened band gaps primarily contributed by distorted U clusters.

INORGANIC CHEMISTRY (2023)

Article Chemistry, Physical

Relativistic electronic structure and photovoltaic performance of K2CsSb

Ruiqi Wu, Alex M. Ganose

Summary: The study discovers that K2CsSb exhibits ideal properties for use in photovoltaic applications, and its maximum theoretical efficiency can compete with other state-of-the-art photovoltaics.

JOURNAL OF MATERIALS CHEMISTRY A (2023)

Article Materials Science, Multidisciplinary

Designing transparent conductors using forbidden optical transitions

Rachel Woods-Robinson, Yihuang Xiong, Jimmy-Xuan Shen, Nicholas Winner, Matthew K. Horton, Mark Asta, Alex M. Ganose, Geoffroy Hautier, Kristin A. Persson

Summary: Many semiconductors have weak or forbidden transitions at their band edges, creating a widened transparency region. The presence of forbidden transitions has been neglected in the study of new p-type transparent conductors, unlike in n-type transparent conductors. Through computation and analysis of absorption spectra, it is found that compounds with highly localized band-edge states are more likely to exhibit forbidden transitions. This study suggests potential p-type and n-type transparent conductor candidates with forbidden transitions.

MATTER (2023)

Article Chemistry, Physical

Band gap opening from displacive instabilities in layered covalent-organic frameworks

Ju Huang, Matthias J. Golomb, Sean R. Kavanagh, Kasper Tolborg, Alex M. Ganose, Aron Walsh

Summary: Covalent organic frameworks (COFs) exhibit a high degree of flexibility in both chemical and structural aspects, with a diverse range of stacking sequences accessible. Computational study reveals a preference for slipped structures with horizontal offsets in two prototypical COFs, leading to significant changes in the underlying electronic structure and a band gap opening of 0.8-1.4 eV. Implications for screening and selecting COFs for energy applications are discussed.

JOURNAL OF MATERIALS CHEMISTRY A (2022)

Article Computer Science, Artificial Intelligence

Quantifying the advantage of domain-specific pre-training on named entity recognition tasks in materials science

Amalie Trewartha, Nicholas Walker, Haoyan Huo, Sanghoon Lee, Kevin Cruse, John Dagdelen, Alexander Dunn, Kristin A. Persson, Gerbrand Ceder, Anubhav Jain

Summary: Efficiently connecting new materials discoveries to established literature can be achieved by using named entity recognition (NER) to extract structured summary-level data from unstructured materials science text. In this study, the performance of four NER models on materials science datasets is compared, and it is found that domain-specific pre-training provides measurable advantages, with the bidirectional long short-term memory (BiLSTM) model consistently outperforming BERT.

PATTERNS (2022)

Article Chemistry, Physical

Lone pair driven anisotropy in antimony chalcogenide semiconductors

Xinwei Wang, Zhenzhu Li, Sean R. Kavanagh, Alex M. Ganose, Aron Walsh

Summary: Antimony sulfide and selenide have emerged as promising alternatives for thin-film photovoltaic compounds. The anisotropic crystal structures of these materials, composed of quasi-one-dimensional ribbons, have been found to exhibit inter-ribbon interactions beyond the van der Waals regime. The stereochemical activity of the lone pair of Sb 5s is shown to be the origin of the anisotropic structures. The impacts of structural anisotropy on the electronic, dielectric, and optical properties, as well as charge carrier transport, are examined, providing guidelines for optimizing the performance of Sb2X3-based photovoltaics.

PHYSICAL CHEMISTRY CHEMICAL PHYSICS (2022)

暂无数据