Article
Statistics & Probability
Danijel Kivaranovic, Hannes Leeb
Summary: Research on valid inference after model selection is actively exploring the polyhedral method for constructing confidence intervals, with varying lengths depending on the model and computation complexity. Simulation results suggest that the sufficient condition for infinite length is met unless the selected model includes almost all or almost none of the available regressors. Additionally, kappa-quantiles exhibit a similar behavior for kappa close to 1 in the distribution of confidence interval lengths.
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
(2021)
Article
Computer Science, Artificial Intelligence
Luca Martino, Roberto San Millan-Castillo, Eduardo Morgado
Summary: We propose a generalized information criterion that encompasses other well-known criteria such as BIC and AIC. Our proposed SIC is even more general as it does not strictly require knowledge of a likelihood function. It extracts geometric features of the error curve and can be considered an automatic elbow detector.
EXPERT SYSTEMS WITH APPLICATIONS
(2023)
Article
Biology
Haibing Zhao
Summary: Post-selection inference on thousands of parameters has been studied in recent years. This article proposes two methods for narrowing the confidence intervals, by incorporating the selection event into the calculation and focusing on parameters with very small selection probabilities.
Article
Environmental Sciences
Gregor Miller, Annette Menzel, Donna P. Ankerst
Summary: This study assessed the associations between air pollution variables and COVID-19 mortality in Germany, taking into account potential confounders. The results showed that the associations became non-significant when other risk factors were considered in the model, highlighting the importance of adequately accounting for confounders in the analysis.
ENVIRONMENTAL SCIENCES EUROPE
(2022)
Article
Biology
A. Mccloskey
Summary: I propose a new type of confidence interval that allows correct asymptotic inference after model selection without assuming any model is correctly specified. This hybrid confidence interval combines techniques from selective inference and post-selection inference to provide a short interval across various data realizations. The results show that hybrid confidence intervals have correct asymptotic coverage over a broad class of probability distributions with no bound on scaled model parameters. Monte Carlo experiments and an empirical application on diabetes disease progression predictors demonstrate the desirable length and coverage properties of these confidence intervals in small samples.
Article
Computer Science, Artificial Intelligence
Wai Hoh Tang, Adrian Rollin
Summary: By training convolutional neural networks on synthetic data with known ground truths, we found that in ARMA time series models, this approach significantly outperforms traditional likelihood-based methods in terms of accuracy and speed, particularly in statistical inference and time series forecasting. This study demonstrates the feasibility of using artificial neural networks for statistical inference in situations where classical likelihood-based methods are difficult or costly to implement.
DECISION SUPPORT SYSTEMS
(2021)
Article
Environmental Sciences
Ping Xie, Jingqun Huo, Yan-Fang Sang, Yaqing Li, Jie Chen, Ziyi Wu, Vijay P. Singh
Summary: Quantifying the dependence characteristics in hydrological time series is crucial for understanding hydrological variability and managing water resources. This article proposes a correlation coefficient-based information criterion (CCIC) to determine the optimal model order for quantifying dependence characteristics. Experimental results verify the accuracy of CCIC and its superiority over existing criteria, and its application to annual precipitation in China further demonstrates its advantages.
WATER RESOURCES RESEARCH
(2022)
Article
Physics, Multidisciplinary
Wei Dai, Ka Wai Tsang
Summary: Linear models are widely used in econometrics to analyze relationships and estimate parameters. However, empirical studies have shown that these models often do not fit real-world data well, and their selection is a critical issue. In the era of big data, one approach is to consider a large number of covariates and use model selection. However, many model selection methods perform poorly in practice due to insufficient sample size, especially for financial data that are often correlated and have small samples. This study addresses the challenge of constructing accurate confidence intervals after model selection, and proposes a resampling approach with consistent estimators. Theoretical, simulation, and empirical results demonstrate the advantages of this method.
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS
(2023)
Article
Economics
Osman Dogan
Summary: In this paper, the modified harmonic mean method is proposed for estimating marginal likelihood functions of cross-sectional spatial autoregressive models. The method is shown to be applicable for popular cross-sectional spatial autoregressive models in a Bayesian estimation setting. A simulation study is conducted to investigate the performance of this estimator along with other popular information criteria for model selection problems. The simulation results demonstrate that the modified harmonic mean estimator performs well and can be useful for specification search exercises in spatial econometrics.
Article
Mathematics
Nitzan Cohen, Yakir Berchenko
Summary: This article proposes a new approach that enables the use of classic information criteria for model selection with missing data by normalizing the information criteria theory, which is found to be exponentially better in computational complexity than traditional imputation methods, leading to increased statistical efficiency.
Article
Statistics & Probability
R. Alraddadi, Q. Shao
Summary: The study proposes a two-step model selection procedure for ARMA models, suitable for time series contaminated with nonlinear trends. The method is able to effectively identify the true model and maintains asymptotic properties as sample size increases.
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS
(2022)
Article
Computer Science, Hardware & Architecture
Saleh Albahli, Ghulam Nabi Ahmad Hassan Yar
Summary: This study focuses on data cleaning and selection of best parameter values for software defect prediction. The Akaike information criterion (AIC) and the Bayesian information criterion (BIC) were used to select the best variables, and a simple ANN model was trained. The results show that the combination of two variables, ns and entropy, is the best for software defect prediction.
COMPUTER SYSTEMS SCIENCE AND ENGINEERING
(2022)
Article
Biochemical Research Methods
Alisa O. Tokareva, Vitaliy V. Chagovets, Alexey S. Kononikhin, Natalia L. Starodubtseva, Evgeny N. Nikolaev, Vladimir E. Frankevich
Summary: A reliable diagnostic model can be built by selecting lipid species with the most discriminative potential and developing the model based on these lipids.
JOURNAL OF MASS SPECTROMETRY
(2021)
Article
Statistics & Probability
Jesse Frey, Yimin Zhang
Summary: Melded confidence intervals are proposed to combine two one-sample confidence intervals, but they do not guarantee the nominal coverage when calculating the difference in population quantiles.
AMERICAN STATISTICIAN
(2023)
Article
Health Care Sciences & Services
Emeline Courtois, Pascale Tubert-Bitter, Ismail Ahmed
Summary: This study proposes a new signal detection methodology based on the adaptive lasso, which shows equivalent or better performances compared to other competitors in simulations and real data applications. The CISL-based adaptive weights have an advantage over others, making the adaptive lasso a solid alternative for signal detection in pharmacovigilance.
BMC MEDICAL RESEARCH METHODOLOGY
(2021)
Article
Statistics & Probability
Hannes Leeb
ANNALS OF STATISTICS
(2009)
Article
Statistics & Probability
Hannes Leeb
ANNALS OF STATISTICS
(2013)
Article
Statistics & Probability
Hannes Leeb
Article
Statistics & Probability
Nina Huber, Hannes Leeb
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS
(2013)
Article
Statistics & Probability
Adityanand Guntuboyina, Hannes Leeb
ELECTRONIC COMMUNICATIONS IN PROBABILITY
(2009)
Article
Economics
Hannes Leeb, Benedikt M. Poetscher
JOURNAL OF ECONOMETRICS
(2008)
Article
Statistics & Probability
Benedikt M. Poetscher, Hannes Leeb
JOURNAL OF MULTIVARIATE ANALYSIS
(2009)
Correction
Economics
Hannes Leeb, Benedikt M. Poetscher
ECONOMETRIC THEORY
(2008)
Article
Statistics & Probability
Paul Kabaila, Khageswor Giri, Hannes Leeb
ELECTRONIC JOURNAL OF STATISTICS
(2010)
Article
Economics
Hannes Leeb, Benedikt M. Poetscher
ECONOMETRIC THEORY
(2008)
Article
Statistics & Probability
Hannes Leeb, Benedikt M. Poetscher
ANNALS OF STATISTICS
(2006)
Article
Statistics & Probability
Paul Kabaila, Hannes Leeb
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
(2006)
Article
Economics
H Leeb, BM Pötscher
ECONOMETRIC THEORY
(2006)
Article
Economics
H Leeb, BM Pötscher
ECONOMETRIC THEORY
(2005)