4.2 Article

Optimal finite sample post-selection confidence distributions in generalized linear models

期刊

出版社

ELSEVIER
DOI: 10.1016/j.jspi.2022.06.001

关键词

Confidence distribution; Confidence interval; Exponential family; Model selection; Post-selection inference; Sufficiency

向作者/读者索取更多资源

Uniformly most powerful confidence distributions are derived for parameters in selected models of the exponential family. Conditioning on the selection event and sufficient statistics of nuisance parameters ensures valid post-selection inference. Optimal confidence intervals are obtained directly from the confidence distribution without the need for inversion of pivotal quantities. Simulations demonstrate the effectiveness of the method even when all models are misspecified.
Uniformly most powerful confidence distributions are obtained for parameters in selected models of the exponential family. A conditioning on the selection event as well as on the sufficient statistics of nuisance parameters guarantees valid post-selection inference. Optimal confidence intervals are obtained directly from the confidence distribution without requiring an inversion of pivotal quantities. Simulations showcase that the method works also when all models are misspecified.(c) 2022 Elsevier B.V. All rights reserved.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Statistics & Probability

Exact uniformly most powerful postselection confidence distributions

Andrea C. Garcia-Angulo, Gerda Claeskens

Summary: Conditioning on the event of selecting a model leads to the construction of uniformly optimal conditional confidence distributions, which can be used for valid postselection inference, including the construction of confidence intervals and hypothesis tests.

SCANDINAVIAN JOURNAL OF STATISTICS (2023)

Article Statistics & Probability

Linear Manifold Modeling and Graph Estimation based on Multivariate Functional Data with Different Coarseness Scales

Eugen Pircalabelu, Gerda Claeskens

Summary: We propose a high-dimensional graphical modeling approach for functional data, which addresses the issue of having more functions than available samples. By using a sparse estimator and identifying linear manifolds, we can infer the connections between functional nodes and estimate multiple related graphs simultaneously.

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS (2023)

Article Statistics & Probability

Automatic bias correction for testing in high-dimensional linear models

Jing Zhou, Gerda Claeskens

Summary: This research proposes a robust testing framework to deal with the complicated asymptotic distribution of test statistics based on regularized estimators in high dimensions. The framework is implemented using the robust approximate message passing algorithm and includes built-in bias correction. It is applicable to general convex non-differentiable loss functions and allows for inference on conditional quantiles.

STATISTICA NEERLANDICA (2023)

Article Statistics & Probability

Modeling the Occurrence of Events Subject to a Reporting Delay via an EM Algorithm

Roel Verbelen, Katrien Antonio, Gerda Claeskens, Jonas Crevecoeur

Summary: This study examines the relationship between the delay in occurrence and reporting of events and proposes a flexible regression framework for the joint estimation of event occurrence and reporting. By relating this framework to an incomplete data problem, estimation is carried out using an expectation-maximization algorithm. The proposed method is elegant, easy to understand and implement, and provides accurate predictions.

STATISTICAL SCIENCE (2022)

Article Statistics & Probability

Robust and efficient estimation of nonparametric generalized linear models

Ioannis Kalogridis, Gerda Claeskens, Stefan Van Aelst

Summary: Generalized linear models are flexible tools for analyzing diverse datasets, but the classical formulation has limitations such as the requirement for correctly specified parametric component and absence of atypical observations. To overcome these issues, we propose and study a family of nonparametric spline estimators that are obtained through the minimization of a penalized density power divergence. These estimators are easily implementable, offer high protection against outliers, and can be tuned for high efficiency in clean data scenarios. We demonstrate that these estimators converge rapidly under weak assumptions and showcase their competitive performance through simulation and real-data examples.
Article Pharmacology & Pharmacy

A report on SARS-CoV-2 first wave in Ecuador: drug consumption dynamics

Andrea Orellana-Manzano, Fernanda B. Cordeiro, Andrea Garcia-Angulo, Elizabeth Centeno, Maria Jose Vizcaino-Tumbaco, Sebastian Poveda, Ricardo Murillo, Derly Andrade-Molina, Mariuxi Miraba, Saurabh Mehta, Washington Cardenas

Summary: This retrospective study investigated 10,175 individuals who underwent RT-PCR tests in Ecuador from July to November 2020. The results showed no association between RT-PCR results and sex, age, or comorbidities in positive COVID-19 cases. Cotopaxi and Napo had the highest rates of positive cases, while Manabi, Santa Elena, and Guayas regions had lower rates. Drug consumption was higher in negative COVID-19 cases compared to positive cases, and acetaminophen was the most commonly consumed medication in both groups. Acetaminophen and antihistamines had higher odds of consumption in positive cases. Symptoms like fever and cough were more related to positive RT-PCR results.

FRONTIERS IN PHARMACOLOGY (2023)

Meeting Abstract Pharmacology & Pharmacy

Pharmacovigilance of the COVID-19 Vaccine in Ecuador During the First and Second Doses of Vaccination Campaigns

Andrea Orellana Manzano, Fernanda B. Cordeiro, Andrea Garcia-Angulo, Diana Carvajal-Aldaz, Elizabeth Centeno, Maria J. Vizcaino, Sebastian Poveda, Ricardo Murillo, Derly Andrade-Molina, Saurabh Mehta, Washinton Cardenas

JOURNAL OF PHARMACOLOGY AND EXPERIMENTAL THERAPEUTICS (2023)

Article Economics

A hierarchical mixture cure model with unobserved heterogeneity for credit risk

Lore Dirick, Gerda Claeskens, Andrey Vasnev, Bart Baesens

Summary: This study focuses on the specific nature of credit loan data and proposes the use of mixture cure models for survival analysis. The constructed models incorporate competing risks, such as early repayment and default, as well as maturity and unobserved heterogeneity within risk groups. A hierarchical expectation-maximization algorithm is developed to fit the models and estimate standard errors. Simulations and data analysis demonstrate the applicability and benefits of these models, particularly in improving event time estimation.

ECONOMETRICS AND STATISTICS (2022)

Article Statistics & Probability

Nonparametric C- and D-vine-based quantile regression

Marija Tepegjozova, Jing Zhou, Gerda Claeskens, Claudia Czado

Summary: Quantile regression is an increasingly important field in statistical modeling. This paper introduces a nonparametric quantile regression approach based on copulas, which allows separate modeling of marginal distributions and dependence structure, resulting in a flexible and accurate model.

DEPENDENCE MODELING (2022)

Article Statistics & Probability

Nonlinear mixed effects modeling and warping for functional data using B-splines

Gerda Claeskens, Emilie Devijver, Irene Gijbels

Summary: The paper introduces the use of a nonlinear mixed effects model in functional data to find the global mean pattern and capture individual curve differences in phase and amplitude. The study provides sufficient and necessary conditions for model identifiability, and validates the effectiveness of the proposed method through theoretical and simulation research.

ELECTRONIC JOURNAL OF STATISTICS (2021)

Article Statistics & Probability

Detangling robustness in high dimensions: Composite versus model-averaged estimation

Jing Zhou, Gerda Claeskens, Jelena Bradic

ELECTRONIC JOURNAL OF STATISTICS (2020)

Article Automation & Control Systems

Community-Based Group Graphical Lasso

Eugen Pircalabelu, Gerda Claeskens

JOURNAL OF MACHINE LEARNING RESEARCH (2020)

Article Statistics & Probability

Estimation for the Cox model with biased sampling data via risk set sampling

Omidali Aghababaei Jazi

Summary: In this paper, a pseudo-partial likelihood estimation method is proposed to estimate parameters in the Cox proportional hazards model with right-censored and biased sampling data by adjusting sample risk sets. The asymptotic properties of the resulting estimator are studied, and a simulation study is conducted to illustrate the finite sample performance. The proposed method is also applied to analyze a set of HIV/AIDS data.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Robust penalized empirical likelihood in high dimensional longitudinal data analysis

Liya Fu, Shuwen Hu, Jiaqi Li

Summary: Empirical likelihood (EL) is an effective nonparametric method that combines estimating equations flexibly and adaptively. A penalized EL method based on robust estimating functions is proposed for variable selection in a high-dimensional model, allowing the dimensions to grow exponentially with the sample size. The proposed method improves robustness and effectiveness in the presence of outliers or heavy-tailed data. Extensive simulation studies and a real data example demonstrate the enhanced variable selection accuracy when dealing with heavy-tailed data or outliers.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Subgroup analysis for the functional linear model

Yifan Sun, Ziyi Liu, Wu Wang

Summary: This paper extends the classical functional linear regression model to allow for heterogeneous coefficient functions among different subgroups of subjects. A penalization-based approach is proposed to simultaneously determine the number and structure of subgroups and coefficient functions within each subgroup. The paper provides an effective computational algorithm and establishes the oracle properties and estimation consistency of the model. Extensive numerical simulations demonstrate its superiority compared to competing methods, and an analysis of an air quality dataset leads to interesting findings and improved predictions.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

A pair of novel priors for improving and extending the conditional MLE

Takemi Yanagimoto, Yoichi Miyata

Summary: A Bayesian estimator is proposed to improve the conditional maximum likelihood estimation by introducing a pair of priors. The conditional maximum likelihood estimation is explained using the posterior mode under a prior, and a promising estimator is defined using the posterior mean under a corresponding prior. The advantages of this approach include two different optimality properties of the induced estimator, the ease of various extensions, and the possible treatments for a finite sample size. The existing approaches are discussed and critiqued.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Jackknife empirical likelihood confidence intervals for the categorical Gini correlation

Sameera Hewage, Yongli Sang

Summary: This paper introduces a new method for measuring dependence, the categorical Gini correlation rho(g), and proposes a Jackknife empirical likelihood approach for constructing confidence intervals. Simulation studies and real data applications demonstrate competitive performance of the proposed method in terms of coverage accuracy and interval length.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

A multidimensional objective prior distribution from a scoring rule

Isadora Antoniano-Villalobos, Cristiano Villa, Stephen G. Walker

Summary: Constructing objective priors for multidimensional parameter spaces is challenging, and a common approach assumes independence and uses standard objective methods to obtain marginal distributions. In this paper, a novel objective prior is proposed by extending the objective method for one-dimensional case, allowing for a dependence structure in multidimensional parameter spaces.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Construction of optimal supersaturated designs by the expansive replacement method

Hui Li, Liuqing Yang, Kashinath Chatterjee, Min-Qian Liu

Summary: Supersaturated design (SSD) plays a crucial role in factor screening, and E(f(NOD)) criterion is one of the most widely used criteria for evaluating multi-level and mixed-level SSDs. This paper provides methods to construct multi-level E(f(NOD)) optimal SSDs with general run sizes, which can also be extended to mixed-level SSDs. The main idea of these methods is to combine two processed generalized Hadamard matrices with the expansive replacement method. These proposed methods are easy to implement, and the non-orthogonality between any two columns of the resulting SSDs is well controlled by that of the source designs.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

A comparison of likelihood-based methods for size-biased sampling

Victoria L. Leaver, Robert G. Clark, Pavel N. Krivitsky, Carole L. Birrell

Summary: This article compares three likelihood approaches to estimation under informative sampling and examines their efficiency and asymptotic variance. The study shows that sample likelihood estimation approaches the efficiency of full maximum likelihood estimation when the sample size tends to infinity and the sampling fraction tends to zero. However, when the sample size tends to infinity and the sampling fraction is not negligible, maximum likelihood estimation is more efficient due to considering the possibility of duplicate samples. Pseudo-likelihood estimation can perform poorly in certain cases. For a special case where the superpopulation is exponential and the selection is probability proportional to size, the anticipated variance of pseudo-likelihood estimation is infinite.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Maximum likelihood estimation of the log-concave component in a semi-parametric mixture with a standard normal density

Fadoua Balabdaoui, Harald Besdziek

Summary: The two-component mixture model with known background density, unknown signal density, and unknown mixing proportion has been studied in this paper. The log-concave MLE of the signal density is computed using the estimator of Patra & Sen (2016), and its consistency and convergence are shown. The performance of this method is evaluated through a simulation study.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Time changes and stationarity issues for extended scalar autoregressive models

V. Girardin, R. Senoussi

Summary: This paper investigates different issues related to stationarity reduction in autoregressive models, including both continuous and discrete time cases. Necessary and sufficient conditions for autoregressive models to be weakly stationary are explored, with explicit formulas for the time changes. Furthermore, the issue of stationarity reduction for discrete sequences sampled from continuous time autoregressive processes is also considered.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Regression models for circular data based on nonnegative trigonometric sums

Juan Jose Fernandez-Duran, Maria Mercedes Gregorio-Dominguez

Summary: This paper presents the application of nonnegative trigonometric sums (NNTS) models in circular data analysis. Regression models for circular-dependent variables are constructed by fitting great circles on the parameter hypersphere, enabling the identification of different regions along the circle. The transformation of the original circular variable into a linear variable allows for the application of common linear regression methods in circular data analysis.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

Robust inference for subgroup analysis with general transformation models

Miao Han, Yuanyuan Lin, Wenxin Liu, Zhanfeng Wang

Summary: The article proposes a method based on maximum rank correlation and concave fusion to automatically determine the number of subgroups, identify subgroup structure, and estimate subgroup-specific covariate effects. The method can be used without prior grouping information and is applicable to handling censored data.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)

Article Statistics & Probability

A framework of zero-inflated bayesian negative binomial regression models for spatiotemporal data

Qing He, Hsin-Hsiung Huang

Summary: This article introduces a method for spatiotemporal data analysis with massive zeros, which is widely used in epidemiology and public health. The method fits zero-inflated negative binomial models using a Bayesian framework and employs latent variables from Polya-Gamma distributions to improve computational efficiency.

JOURNAL OF STATISTICAL PLANNING AND INFERENCE (2024)