4.5 Article

Distances and inference for covariance operators

Journal

BIOMETRIKA
Volume 101, Issue 2, Pages 409-422

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/biomet/asu008

Keywords

Distance metric; Functional data analysis; Procrustes analysis; Shape analysis

Funding

  1. Engineering and Physical Sciences Research Council
  2. Royal Society
  3. EPSRC [EP/K021672/2, EP/K021672/1] Funding Source: UKRI
  4. Engineering and Physical Sciences Research Council [EP/K021672/2, EP/K021672/1] Funding Source: researchfish

Ask authors/readers for more resources

A framework is developed for inference concerning the covariance operator of a functional random process, where the covariance operator itself is an object of interest for statistical analysis. Distances for comparing positive-definite covariance matrices are either extended or shown to be inapplicable to functional data. In particular, an infinite-dimensional analogue of the Procrustes size-and-shape distance is developed. Convergence of finite-dimensional approximations to the infinite-dimensional distance metrics is also shown. For inference, a Frechet estimator of both the covariance operator itself and the average covariance operator is introduced. A permutation procedure to test the equality of the covariance operators between two groups is also considered. Additionally, the use of such distances for extrapolation to make predictions is explored. As an example of the proposed methodology, the use of covariance operators has been suggested in a philological study of cross-linguistic dependence as a way to incorporate quantitative phonetic information. It is shown that distances between languages derived from phonetic covariance functions can provide insight into the relationships between the Romance languages.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Engineering, Biomedical

Local and global energies for shape analysis in medical imaging

Valerio Varano, Paolo Piras, Stefano Gabriele, Luciano Teresi, Paola Nardinocchi, Ian L. Dryden, Concetta Torromeo, Michele Schiariti, Paolo E. Puddu

INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN BIOMEDICAL ENGINEERING (2020)

Editorial Material Operations Research & Management Science

Discussion of A review of data science in business and industry and a future view, by Grazia Vicario and Shirley Coleman

Piercesare Secchi

APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY (2020)

Article Statistics & Probability

Regression Modeling for Size-and-Shape Data Based on a Gaussian Model for Landmarks

Ian L. Dryden, Alfred Kume, Phillip J. Paine, Andrew T. A. Wood

Summary: This article proposes a regression model for size-and-shape response data, with the regression structure entering through the landmark means. Two approaches to parameter estimation are considered, one based on marginal likelihood and the other using the EM algorithm for model-consistent estimation. The article explains how to deal with challenging computational issues and demonstrates the usefulness of the regression modeling framework with real-data examples.

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION (2021)

Article Statistics & Probability

Smoothing splines on Riemannian manifolds, with applications to 3D shape space

Kwang-Rae Kim, Ian L. Dryden, Huiling Le, Katie E. Severn

Summary: This paper generalizes a smoothing spline fitting method to Riemannian manifold data and develops such a fitting procedure for shapes of configurations in general m-dimensional Euclidean space. The parallel transport along a geodesic on Kendall shape space is linked to the solution of a homogeneous first-order differential equation, enabling the approximation of unrolling and unwrapping procedures for numerical solutions for higher dimensional shape data. The fitting method is applied to analyze dynamic 3D peptide data.

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY (2021)

Article Geosciences, Multidisciplinary

Object oriented spatial analysis of natural concentration levels of chemical species in regional-scale aquifers

Alessandra Menafoglio, Laura Guadagnini, Alberto Guadagnini, Piercesare Secchi

Summary: This study addresses the characterization of spatially variable Natural Background Levels (NBLs) of concentrations of chemical species in a large-scale groundwater body. By modeling concentration PDFs as random points in a Bayes Hilbert space, the study enables spatial prediction and uncertainty quantification using Object Oriented Data Analysis. The use of depth measures for distributional data helps in detecting central and outlying NBL distributions and building prediction regions for NBL distribution at unsampled locations.

SPATIAL STATISTICS (2021)

Article Statistics & Probability

Non-parametric regression for networks

Katie E. Severn, Ian L. Dryden, Simon P. Preston

Summary: This study focuses on developing a suitable methodology for statistical analysis of network data using graph Laplacian matrices, with applications in areas such as communication networks and corpus linguistics. An adapted Nadaraya-Watson estimator is developed for estimating regression curves in dynamic networks, with uniform weak consistency for both Euclidean and power Euclidean metrics. The methodology is applied to the Enron email corpus to model trends in monthly networks and detect anomalous patterns, as well as in corpus linguistics to explore changes in writing style over time based on word co-occurrence networks.
Article Engineering, Biomedical

Discovery of synergistic material-topography combinations to achieve immunomodulatory osteoinductive biomaterials using a novel in vitro screening method: The ChemoTopoChip

Laurence Burroughs, Mahetab H. Amer, Matthew Vassey, Britta Koch, Grazziela P. Figueredo, Blessing Mukonoweshuro, Paulius Mikulskis, Aliaksei Vasilevich, Steven Vermeulen, Ian L. Dryden, David A. Winkler, Amir M. Ghaemmaghami, Felicity R. A. J. Rose, Jan de Boer, Morgan R. Alexander

Summary: In this study, a novel combinatorial chemistry-topography screening platform, the ChemoTopoChip, was used to identify materials suitable for bone regeneration by screening for human mesenchymal stem cell (hiMSCs) and human macrophage response. The results show that the materials selected through this platform can induce osteoinduction in hiMSCs and modulate macrophage phenotype, providing a materials-induced alternative to osteo-inductive supplements in bone-regeneration.

BIOMATERIALS (2021)

Article Social Sciences, Mathematical Methods

Social and material vulnerability in the face of seismic hazard: An analysis of the Italian case

Oleksandr Didkovskyi, Giovanni Azzone, Alessandra Menafoglio, Piercesare Secchi

Summary: The study evaluates the vulnerability of Italian municipalities exposed to seismic hazard by analyzing open data provided by ISTAT, in addition to considering factors such as demographic dynamics and the age of building stock. It offers a tentative ranking of social and material vulnerability, along with differential profiles of dominant fragilities, to aid in planning precision policies aimed at seismic risk prevention and reduction.

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY (2021)

Article Statistics & Probability

Covariance-based low-dimensional registration for function-on-function regression

Tobia Boschi, Francesca Chiaromonte, Piercesare Secchi, Bing Li

Summary: The study introduces a new low-dimensional registration procedure that utilizes functional covariance components to capture meaningful modes of dependency between two sets of curves. The procedure aligns the curves simultaneously to optimize subsequent regression analysis, implemented using continuous registration algorithm and a novel parallel algorithm in R, and compared to other common registration approaches through simulations and application to the AneuRisk data.
Article Chemistry, Physical

GLIMPS: A Machine Learning Approach to Resolution Transformation for Multiscale Modeling

Keverne A. Louison, Ian L. Dryden, Charles A. Laughton

Summary: This approach uses machine learning to transform molecular models between different levels of resolution, requiring only particle coordinates for training, and enabling bidirectional transformation.

JOURNAL OF CHEMICAL THEORY AND COMPUTATION (2021)

Article Statistics & Probability

The Bayesian Spatial Bradley-Terry model: Urban deprivation modelling in Tanzania

Rowland G. Seymour, David Sirl, Simon P. Preston, Ian L. Dryden, Madeleine J. A. Ellis, Bertrand Perrat, James Goulding

Summary: Identifying the most deprived regions in developing countries can be challenging due to logistical issues with traditional household surveys. The Bradley-Terry model, utilizing comparisons of affluence in different areas, provides a promising solution to simplify logistics and avoid biases. A novel Bayesian Spatial Bradley-Terry model has been developed to effectively decrease the number of comparisons required for inference, demonstrating practical effectiveness in a data set collected in Dar es Salaam, Tanzania.

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS (2022)

Article Statistics & Probability

MANIFOLD VALUED DATA ANALYSIS OF SAMPLES OF NETWORKS, WITH APPLICATIONS IN CORPUS LINGUISTICS

Katie E. Severn, Ian L. Dryden, Simon P. Preston

Summary: This study proposes a general framework for extrinsic statistical analysis of network samples in various applications, such as text documents, social interactions, and brain activity. By defining the graph Laplacian matrices of networks and corresponding metrics, embeddings, tangent spaces, the framework enables computing means, performing principal component analysis, regression, and hypothesis tests.

ANNALS OF APPLIED STATISTICS (2022)

Article Computer Science, Artificial Intelligence

A general bi-clustering algorithm for object data with an application to the analysis of a Lombardy railway line

Agostino Torti, Marta Galvani, Alessandra Menafoglio, Piercesare Secchi, Simone Vantini

Summary: This article presents a general and flexible bi-clustering algorithm for the analysis of Hilbert data. The algorithm, named HC2, is applied to analyze the regional railway service in the Lombardy region, aiming to identify recurrent patterns in passengers' daily access. By modeling the data as multivariate functional data and time series, this approach enables the measurement of both overcrowding and travel demand, providing useful insights for optimizing the service.

INTERNATIONAL JOURNAL OF APPROXIMATE REASONING (2022)

Article Geosciences, Multidisciplinary

A look at the spatio-temporal mortality patterns in Italy during the COVID-19 pandemic through the lens of mortality densities

Riccardo Scimone, Alessandra Menafoglio, Laura M. Sangalli, Piercesare Secchi

Summary: Using the tools and perspective of Object Oriented Spatial Statistics, we analyze mortality data in the provinces and municipalities of Italy for the year 2020 and assess the impact of the COVID-19 pandemic on the local death process. We use functional data and linear models to predict expected mortality based on previous years' data and compare predictions with actual observations. Spatial clustering of mortality densities is used to identify anomalous areas. This analysis pipeline can be applied to other granular spatio-temporal data to quantify the disruption caused by the pandemic.

SPATIAL STATISTICS (2022)

Article Environmental Sciences

Bayesian spatiotemporal modelling for disease mapping: an application to preeclampsia and gestational diabetes in Florida, United States

Ning Sun, Zoran Bursac, Ian Dryden, Roberto Lucchini, Sophie Dabo-Niang, Boubakari Ibrahimou

Summary: This study utilizes disease mapping models to estimate the spatiotemporal patterns of disease risks and identifies high-risk clusters for preeclampsia and gestational diabetes in Florida. It also shows that exposure to PM2.5 increases the risk of these diseases, although to a lesser extent compared to previous studies.

ENVIRONMENTAL SCIENCE AND POLLUTION RESEARCH (2023)

No Data Available