Journal
IEEE TRANSACTIONS ON COMPUTERS
Volume 64, Issue 8, Pages 2382-2393Publisher
IEEE COMPUTER SOC
DOI: 10.1109/TC.2014.2366754
Keywords
Atmospheric modeling; MIC; Stencil; Tianhe-2
Funding
- National Basic Research Program of China [2010CB951903]
- National High-tech R&D 863 Program of China [2013AA01A208]
- Information Technology Program of Chinese Academy of Sciences [XXH12503-02-02-03]
- Natural Science Foundation of China [61361120098, 61170075, 91130023, 61303003, 51190101, 41374113]
- Australia India Strategic Research Grant [AISRF-08140]
- department of industry, Australia
Ask authors/readers for more resources
In this work an ultra-scalable algorithm is designed and optimized to accelerate a 3D compressible Euler atmospheric model on the CPU-MIC hybrid system of Tianhe-2. We first reformulate the mesocale model to avoid long-latency operations, and then employ carefully designed inter-node and intra-node domain decomposition algorithms to achieve balance utilization of different computing units. Proper communication-computation overlap and concurrent data transfer methods are utilized to reduce the cost of data movement at scale. A variety of optimization techniques on both the CPU side and the accelerator side are exploited to enhance the in-socket performance. The proposed hybrid algorithm successfully scales to 6,144 Tianhe-2 nodes with a nearly ideal weak scaling efficiency, and achieve over 8 percent of the peak performance in double precision. This ultra-scalable hybrid algorithm may be of interest to the community to accelerating atmospheric models on increasingly dominated heterogeneous supercomputers.
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available