4.4 Article Proceedings Paper

Density-fitted singles and doubles coupled cluster on graphics processing units

期刊

MOLECULAR PHYSICS
卷 112, 期 5-6, 页码 844-852

出版社

TAYLOR & FRANCIS LTD
DOI: 10.1080/00268976.2013.874599

关键词

graphics processing units; singles doubles coupled cluster; density fitting

资金

  1. US National Science Foundation [ACI-1147843]
  2. National Science Foundation CRIF Award [CHE-0946869]
  3. National Science Foundation American Competitiveness in Chemistry Postdoctoral Fellowship [CHE-1137288]
  4. Center for Nanophase Materials Sciences
  5. Oak Ridge National Laboratory by the Scientific User Facilities Division, U.S. Department of Energy
  6. Office of Science of the U.S. Department of Energy [DE-AC05-00OR22725]
  7. UT-Battelle, LLC
  8. Direct For Mathematical & Physical Scien
  9. Division Of Chemistry [1137288] Funding Source: National Science Foundation
  10. Office of Advanced Cyberinfrastructure (OAC)
  11. Direct For Computer & Info Scie & Enginr [1147843] Funding Source: National Science Foundation

向作者/读者索取更多资源

We adapt an algorithm for singles and doubles coupled cluster (CCSD) that uses density fitting or Cholesky decomposition (CD) in the construction and contraction of all electron repulsion integrals (ERIs) for use on heterogeneous compute nodes consisting of a multicore central processing unit (CPU) and at least one graphics processing unit (GPU). The use of approximate three-index ERIs ameliorates two of the major difficulties in designing scientific algorithms for GPUs: (1) the extremely limited global memory on the devices and (2) the overhead associated with data motion across the bus. For the benzene trimer described by an aug-cc-pVDZ basis set, the use of a single NVIDIA Tesla C2070 (Fermi) GPU accelerates a CD-CCSD computation by a factor of 2.1, relative to the multicore CPU-only algorithm that uses six highly efficient Intel Core i7-3930K CPU cores. The use of two Fermi GPUs provides an acceleration of 2.89, which is comparable to that observed when using a single NVIDIA Kepler K20c GPU (2.73).

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.4
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据