4.0 Article

High-dimensional variable selection accounting for heterogeneity in regression coefficients across multiple data sources

出版社

WILEY
DOI: 10.1002/cjs.11793

关键词

ADMM; coefficient clustering; data heterogeneity; k-means; variable selection

向作者/读者索取更多资源

In this article, we propose a new adaptive clustering penalty (ACP) method for selecting variables and clustering source-specific regression coefficients in high-dimensional linear regression models for integrative data analysis. We demonstrate that the ACP method has a strong oracle property under certain regularity conditions. We also develop an efficient parameter estimation algorithm based on the alternating direction method of multipliers (ADMM). Simulation studies are conducted to compare the performance of the proposed method to three existing methods. The proposed method is then applied to the multicentre Childhood Adenotonsillectomy Trial to identify subhomogeneity in treatment effects across different study sites.
When analyzing data combined from multiple sources (e.g., hospitals, studies), the heterogeneity across different sources must be accounted for. In this article, we consider high-dimensional linear regression models for integrative data analysis. We propose a new adaptive clustering penalty (ACP) method to simultaneously select variables and cluster source-specific regression coefficients with subhomogeneity. We show that the estimator based on the ACP method enjoys a strong oracle property under certain regularity conditions. We also develop an efficient algorithm based on the alternating direction method of multipliers (ADMM) for parameter estimation. We conduct simulation studies to compare the performance of the proposed method to three existing methods (a fused LASSO with adjacent fusion, a pairwise fused LASSO and a multidirectional shrinkage penalty method). Finally, we apply the proposed method to the multicentre Childhood Adenotonsillectomy Trial to identify subhomogeneity in the treatment effects across different study sites.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.0
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据