4.6 Article

The Limitations of Model-Based Experimental Design and Parameter Estimation in Sloppy Systems

Journal

PLOS COMPUTATIONAL BIOLOGY
Volume 12, Issue 12, Pages -

Publisher

PUBLIC LIBRARY SCIENCE
DOI: 10.1371/journal.pcbi.1005227

Keywords

-

Funding

  1. NCI NIH HHS [P30 CA016672] Funding Source: Medline

Ask authors/readers for more resources

We explore the relationship among experimental design, parameter estimation, and systematic error in sloppy models. We show that the approximate nature of mathematical models poses challenges for experimental design in sloppy models. In many models of complex biological processes it is unknown what are the relevant physical mechanisms that must be included to explain system behaviors. As a consequence, models are often overly complex, with many practically unidentifiable parameters. Furthermore, which mechanisms are relevant/irrelevant vary among experiments. By selecting complementary experiments, experimental design may inadvertently make details that were ommitted from the model become relevant. When this occurs, the model will have a large systematic error and fail to give a good fit to the data. We use a simple hyper-model of model error to quantify a model's discrepancy and apply it to two models of complex biological processes (EGFR signaling and DNA repair) with optimally selected experiments. We find that although parameters may be accurately estimated, the discrepancy in the model renders it less predictive than it was in the sloppy regime where systematic error is small. We introduce the concept of a sloppy system-a sequence of models of increasing complexity that become sloppy in the limit of microscopic accuracy. We explore the limits of accurate parameter estimation in sloppy systems and argue that identifying underlying mechanisms controlling system behavior is better approached by considering a hierarchy of models of varying detail rather than focusing on parameter estimation in a single model.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Biochemical Research Methods

Mechanisms of In Vivo Ribosome Maintenance Change in Response to Nutrient Signals

Andrew D. Mathis, Bradley C. Naylor, Richard H. Carson, Eric Evans, Justin Harwell, Jared Knecht, Eric Hexem, Fredrick F. Peelor, Benjamin F. Miller, Karyn L. Hamilton, Mark K. Transtrum, Benjamin T. Bikman, John C. Price

MOLECULAR & CELLULAR PROTEOMICS (2017)

Review Engineering, Chemical

The Spectrum of Mechanism-Oriented Models and Methods for Explanations of Biological Phenomena

C. Anthony Hunt, Ahmet Erdemir, William W. Lytton, Feilim Mac Gabhann, Edward A. Sander, Mark K. Transtrum, Lealem Mulugeta

PROCESSES (2018)

Article Thermodynamics

Effect of extreme temperatures on soil: A calorimetric approach

Lee D. Hansen, Nieves Barros, Mark K. Transtrum, Jose A. Rodriguez-Anon, Jorge Proupin, Veronica Pineiro, Ander Arias-Gonzalez, Nahia Gartzia

THERMOCHIMICA ACTA (2018)

Article Automation & Control Systems

Model Boundary Approximation Method as a Unifying Framework for Balanced Truncation and Singular Perturbation Approximation

Philip E. Pare, David Grimsman, Alma T. Wilson, Mark K. Transtrum, Sean Warnick

IEEE TRANSACTIONS ON AUTOMATIC CONTROL (2019)

Article Engineering, Electrical & Electronic

State Estimation Model Reduction Through the Manifold Boundary Approximation Method

Vanja G. Svenda, Mark K. Transtrum, Benjamin L. Francis, Andrija T. Saric, Aleksandar M. Stankovic

Summary: This paper presents a method for estimating system state by analyzing system observability. The method utilizes information geometry to detect unidentifiable system parameters and states, and simplifies the model by removing reference to unidentifiable state variables. The effectiveness of the method is tested through co-simulation of the physical and cyber system layers.

IEEE TRANSACTIONS ON POWER SYSTEMS (2022)

Article Chemistry, Physical

Bayesian, frequentist, and information geometric approaches to parametric uncertainty quantification of classical empirical interatomic potentials

Yonatan Kurniawan, Cody L. Petrie, Kinamo J. J. Williams, Mark K. Transtrum, Ellad B. Tadmor, Ryan S. Elliott, Daniel S. Karls, Mingjian Wen

Summary: This paper investigates the quantification of parametric uncertainty in classical empirical interatomic potentials using Bayesian and frequentist methods. It reveals that these potentials are typically insensitive and parameters are unidentifiable. Information geometry is used to explain the underlying cause and suggest new parameterizations and simplified models.

JOURNAL OF CHEMICAL PHYSICS (2022)

Review Physics, Multidisciplinary

Information geometry for multiparameter models: new perspectives on the origin of simplicity

Katherine N. Quinn, Michael C. Abbott, Mark K. Transtrum, Benjamin B. Machta, James P. Sethna

Summary: Complex models in various fields often have parameter ambiguity, where the parameters of the model are not well determined by the predictions for collective behavior. This review uses information geometry to explore the concept of sloppiness and its connection to emergent theories. The review discusses the structure of the model manifold and how it can explain why only a few parameter combinations matter for behavior. It also introduces methods for finding simpler models on nearby boundaries of the model manifold and discusses Bayesian priors that favor simpler models.

REPORTS ON PROGRESS IN PHYSICS (2023)

Article Computer Science, Information Systems

Integration of Physics- and Data-Driven Power System Models in Transient Analysis After Major Disturbances

Aleksandar A. Saric, Mark K. Transtrum, Andrija T. Saric, Aleksandar M. Stankovic

Summary: This article explores the analysis of transient phenomena in large-scale power systems subjected to major disturbances from the aspect of interleaving, coordinating, and refining physics- and data-driven models. The study proposes a framework that enables coordinated and seamlessly integrated use of the two types of models in engineered systems.

IEEE SYSTEMS JOURNAL (2023)

Article Chemistry, Multidisciplinary

K-Means Clustering of 51 Geospatial Layers Identified for Use in Continental-Scale Modeling of Outdoor Acoustic Environments

Katrina Pedersen, Ryan R. Jensen, Lucas K. Hall, Mitchell C. Cutler, Mark K. Transtrum, Kent L. Gee, Shane V. Lympany

Summary: Applying machine learning methods to geographic data helps in understanding spatial patterns and interpreting environments. This study used k-means clustering to analyze 51 geospatial layers and identified 8 clusters with distinct characteristics. The results can guide data collection for modeling outdoor acoustic environments.

APPLIED SCIENCES-BASEL (2023)

Proceedings Paper Computer Science, Interdisciplinary Applications

Extending OpenKIM with an Uncertainty Quantification Toolkit for Molecular Modeling

Yonatan Kurniawan, Cody L. Petrie, Mark K. Transtrum, Ellad B. Tadmor, Ryan S. Elliott, Daniel S. Karls, Mingjian Wen

Summary: Atomistic simulations are important in materials modeling, and the choice of interatomic potentials (IPs) greatly affects the accuracy of the predictions. Uncertainty quantification (UQ) is a new tool for assessing the reliability of these simulations. The OpenKIM project aims to standardize the study of IPs and enable transparent research, and the KLIFF Python package provides tools for fitting IP parameters. This paper introduces a UQ extension to KLIFF, focusing on parameter variations and inadequacy of the IP's functional form.

2022 IEEE 18TH INTERNATIONAL CONFERENCE ON E-SCIENCE (ESCIENCE 2022) (2022)

Article Engineering, Electrical & Electronic

Data-Driven Classification, Reduction, Parameter Identification and State Extension in Hybrid Power Systems

Andrija T. Saric, Mark K. Transtrum, Aleksandar M. Stankovic

Summary: This paper presents a manifold learning-based algorithm for big data classification and reduction, as well as parameter identification in real-time operation of a power system. The algorithm examines both black-box and gray-box settings for SCADA- and PMU-based measurements, and uses improved data-informed metric construction for partition trees in data classification. Demonstrations are made on a measurement tensor example of calculated transient dynamics between two SCADA refreshing scans.

IEEE TRANSACTIONS ON POWER SYSTEMS (2021)

Article Engineering, Electrical & Electronic

Symbolic Regression for Data-Driven Dynamic Model Refinement in Power Systems

Andrija T. Saric, Aleksandar A. Saric, Mark K. Transtrum, Aleksandar M. Stankovic

Summary: This paper presents a data-driven symbolic regression identification method designed for power systems, which extends the SINDy modeling procedure to include exogenous signals and nonlinear trigonometric terms. The resulting framework is shown to require minimal data, be computationally efficient, and robust to noise, making it a feasible option for online identification in response to rapid system changes. The proposed method is illustrated on a real-world benchmark example, demonstrating its effectiveness in reducing the differential-algebraic equations-based SG dynamic models.

IEEE TRANSACTIONS ON POWER SYSTEMS (2021)

Article Materials Science, Multidisciplinary

Effect of the density of states at the Fermi level on defect free energies and superconductivity: A case study of Nb3Sn

Nathan S. Sitaraman, Michelle M. Kelley, Ryan D. Porter, Matthias U. Liepe, Tomas A. Arias, Jared Carlson, Alden R. Pack, Mark K. Transtrum, Ravishankar Sundararaman

Summary: The electronic free energy has a profound effect in systems with a high-temperature threshold for kinetics and a high Fermi-level density of states. Antisite defects disrupt the high Fermi-level density of states and locally reduce electronic free energy, affecting superconductivity. The study on Nb3Sn reveals the key role of electronic free energy in determining the behavior of antisite defects, their interactions, and their impact on superconductivity.

PHYSICAL REVIEW B (2021)

Article Materials Science, Multidisciplinary

Analysis of magnetic vortex dissipation in Sn-segregated boundaries in Nb3Sn superconducting RF cavities

Jared Carlson, Alden Pack, Mark K. Transtrum, Jaeyel Lee, David N. Seidman, Danilo B. Liarte, Nathan S. Sitaraman, Alen Senanian, Michelle M. Kelley, James P. Sethna, Tomas Arias, Sam Posen

Summary: Our study investigates the mechanisms of vortex nucleation in Nb3Sn superconducting cavities, revealing Sn segregation at grain boundaries which may affect the local superconducting properties. Using ab initio calculations and time-dependent Ginzburg-Landau theory, simulations show that grain boundaries can act as both nucleation sites and pinning sites for vortices. We estimate the superconducting losses due to vortices filling grain boundaries and compare with experimental observations of cavity heating for performance evaluation.

PHYSICAL REVIEW B (2021)

Article Materials Science, Multidisciplinary

Vortex nucleation in superconductors within time-dependent Ginzburg-Landau theory in two and three dimensions: Role of surface defects and material inhomogeneities

Alden R. Pack, Jared Carlson, Spencer Wadsworth, Mark K. Transtrum

PHYSICAL REVIEW B (2020)

No Data Available