4.6 Article

ANALYSIS OF A SPLITTING APPROACH FOR THE PARALLEL SOLUTION OF LINEAR SYSTEMS ON GPU CARDS

期刊

SIAM JOURNAL ON SCIENTIFIC COMPUTING
卷 39, 期 3, 页码 C215-C237

出版社

SIAM PUBLICATIONS
DOI: 10.1137/15M1039523

关键词

sparse linear system solution; parallel computing; GPU computing; Krylov-subspace; method; preconditioning; work splitting; matrix reordering

资金

  1. National Science Foundation [SI2-SSE-1147337]

向作者/读者索取更多资源

We discuss an approach for solving sparse or dense banded linear systems Ax - b on a graphics processing unit (GPU) card. The matrix A is an element of R-NxN is possibly nonsymmetric and moderately large, i.e., 10, 000 <= N <= 500, 000. The split and parallelize (SaP) approach seeks to partition the matrix A into diagonal subblocks A(i,) i = 1, . . . , p , which are independently factored in parallel. The solution may choose to consider or to ignore the matrices that couple the diagonal subblocks A(i). This approach, along with the Krylov-subspace-based iterative method that it preconditions, are implemented in a solver called SaP : : GPU, which is compared in terms of efficiency with three commonly used sparse direct solvers: PARDISO, SuperLU, and MUMPS. SaP : : GPU, which runs entirely on the GPU except for several stages involved in preliminary row and column permutations, is robust and compares well in terms of efficiency with the aforementioned direct solvers. In a comparison against Intel's MKL, SaP : : GPU also fared well when used to solve dense banded systems that are close to being diagonally dominant. SaP : : GPU is publicly available and distributed as open source under a permissive BSD-3 license.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据