4.6 Article

OPTIMAL LOW-RANK APPROXIMATIONS OF BAYESIAN LINEAR INVERSE PROBLEMS

Journal

SIAM JOURNAL ON SCIENTIFIC COMPUTING
Volume 37, Issue 6, Pages A2451-A2487

Publisher

SIAM PUBLICATIONS
DOI: 10.1137/140977308

Keywords

inverse problems; Bayesian inference; low-rank approximation; covariance approximation; Forstner-Moonen metric; posterior mean approximation; Bayes risk; optimality

Funding

  1. U.S. Department of Energy, Office of Advanced Scientific Computing (ASCR) [DE-SC0003908, DE-SC0009297]

Ask authors/readers for more resources

In the Bayesian approach to inverse problems, data are often informative, relative to the prior, only on a low-dimensional subspace of the parameter space. Significant computational savings can be achieved by using this subspace to characterize and approximate the posterior distribution of the parameters. We first investigate approximation of the posterior covariance matrix as a low-rank update of the prior covariance matrix. We prove optimality of a particular update, based on the leading eigendirections of the matrix pencil defined by the Hessian of the negative log-likelihood and the prior precision, for a broad class of loss functions. This class includes the Forstner metric for symmetric positive definite matrices, as well as the Kullback-Leibler divergence and the Hellinger distance between the associated distributions. We also propose two fast approximations of the posterior mean and prove their optimality with respect to a weighted Bayes risk under squared-error loss. These approximations are deployed in an offline-online manner, where a more costly but data-independent offline calculation is followed by fast online evaluations. As a result, these approximations are particularly useful when repeated posterior mean evaluations are required for multiple data sets. We demonstrate our theoretical results with several numerical examples, including high-dimensional X-ray tomography and an inverse heat conduction problem. In both of these examples, the intrinsic low-dimensional structure of the inference problem can be exploited while producing results that are essentially indistinguishable from solutions computed in the full space.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available