期刊
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING
卷 70, 期 3, 页码 270-281出版社
ACADEMIC PRESS INC ELSEVIER SCIENCE
DOI: 10.1016/j.jpdc.2009.09.007
关键词
Parallel direct solvers; Finite Element Method; hp adaptivity; 3D borehole resistivity
资金
- Foundation for Polish Science
- Polish MNiSW [NN 519 318 635]
- University of Texas at Austin Research Consortium on Formation Evaluation
- Aramco
- Baker Atlas
- BP
- British Gas
- ConocoPhilips
- Chevron
- ENI EP
- ExxonMobil
- Halliburton Energy Services
- Hydro
- Marathon Oil Corporation
- Mexican Institute for Petroleum
- Occidental Petroleum Corporation
- Petrobras
- Schlumberger
- Shell International EP
- Statoil
- TOTAL
- Weatherford
- Spanish Ministry of Science and Innovation [MTM2008-03541, TEC2007-65214, PTQ-08-03-08467]
In this paper we present a new parallel multi-frontal direct solver, dedicated for the hp Finite Element Method (hp-FEM). The self-adaptive hp-FEM generates in a fully automatic mode, a sequence of hp-meshes delivering exponential convergence of the error with respect to the number of degrees of freedom (d.o.f.) as well as the CPU time, by performing a sequence of hp refinements starting from an arbitrary initial mesh. The solver constructs an initial elimination tree for an arbitrary initial mesh, and expands the elimination tree each time the mesh is refined. This allows us to keep track of the order of elimination for the solver. The solver also minimizes the memory usage, by de-allocating partial LU factorizations computed during the elimination stage of the solver, and recomputes them for the backward substitution stage, by utilizing only about 10% of the Computational time necessary for the original computations. The solver has been tested on 3D Direct Current (DC) borehole resistivity measurement simulations problems. We measure the execution time and memory usage of the solver over a large regular mesh with 1.5 million degrees of freedom as well as on the highly non-regular mesh, generated by the self-adaptive hp-FEM, with finite elements of various sizes and polynomial orders of approximation varying from p = 1 to p = 9. From the presented experiments it follows that the parallel solver scales well up to the maximum number of utilized processors. The limit for the solver scalability is the maximum sequential part of the algorithm: the computations of the partial LU factorizations over the longest path, coming from the root of the elimination tree down to the deepest leaf. (C) 2009 Elsevier Inc. All rights reserved.
作者
我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。
推荐
暂无数据