4.2 Article

Massive Parallelization of Serial Inference Algorithms for a Complex Generalized Linear Model

出版社

ASSOC COMPUTING MACHINERY
DOI: 10.1145/2414416.2414791

关键词

Optimization; parallel processing; big data

资金

  1. National Institutes of Health [R01 HG006139]
  2. Google
  3. Foundation for the National Institutes of Health
  4. NATIONAL HUMAN GENOME RESEARCH INSTITUTE [R01HG006139] Funding Source: NIH RePORTER
  5. NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES [R01GM086887] Funding Source: NIH RePORTER

向作者/读者索取更多资源

Following a series of high-profile drug safety disasters in recent years, many countries are redoubling their efforts to ensure the safety of licensed medical products. Large-scale observational databases such as claims databases or electronic health record systems are attracting particular attention in this regard, but present significant methodological and computational concerns. In this article we show how high-performance statistical computation, including graphics processing units, relatively inexpensive highly parallel computing devices, can enable complex methods in large databases. We focus on optimization and massive parallelization of cyclic coordinate descent approaches to fit a conditioned generalized linear model involving tens of millions of observations and thousands of predictors in a Bayesian context. We find orders-of-magnitude improvement in overall run-time. Coordinate descent approaches are ubiquitous in high-dimensional statistics and the algorithms we propose open up exciting new methodological possibilities with the potential to significantly improve drug safety.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.2
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据