4.7 Article

Model selection on solid ground: Rigorous comparison of nine ways to evaluate Bayesian model evidence

期刊

WATER RESOURCES RESEARCH
卷 50, 期 12, 页码 9484-9513

出版社

AMER GEOPHYSICAL UNION
DOI: 10.1002/2014WR016062

关键词

-

资金

  1. German Research Foundation (DFG) within the International Research Training Group Integrated Hydrosystem Modelling at the University of Tubingen [IRTG 1829]
  2. German Research Foundation (DFG) within the Cluster of Excellence in Simulation Technology at the University of Stuttgart [EXC 310/1]
  3. Ministry of Science, Research and Arts of Baden-Wurttemberg [AZ Zu 33-721.3-2]
  4. Helmholtz Centre for Environmental Research, Leipzig

向作者/读者索取更多资源

Bayesian model selection or averaging objectively ranks a number of plausible, competing conceptual models based on Bayes' theorem. It implicitly performs an optimal trade-off between performance in fitting available data and minimum model complexity. The procedure requires determining Bayesian model evidence (BME), which is the likelihood of the observed data integrated over each model's parameter space. The computation of this integral is highly challenging because it is as high-dimensional as the number of model parameters. Three classes of techniques to compute BME are available, each with its own challenges and limitations: (1) Exact and fast analytical solutions are limited by strong assumptions. (2) Numerical evaluation quickly becomes unfeasible for expensive models. (3) Approximations known as information criteria (ICs) such as the AIC, BIC, or KIC (Akaike, Bayesian, or Kashyap information criterion, respectively) yield contradicting results with regard to model ranking. Our study features a theory-based intercomparison of these techniques. We further assess their accuracy in a simplistic synthetic example where for some scenarios an exact analytical solution exists. In more challenging scenarios, we use a brute-force Monte Carlo integration method as reference. We continue this analysis with a real-world application of hydrological model selection. This is a first-time benchmarking of the various methods for BME evaluation against true solutions. Results show that BME values from ICs are often heavily biased and that the choice of approximation method substantially influences the accuracy of model ranking. For reliable model selection, bias-free numerical methods should be preferred over ICs whenever computationally feasible.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据