4.7 Article

The Dynamic Kernel Scheduler-Part 1

Journal

COMPUTER PHYSICS COMMUNICATIONS
Volume 207, Issue -, Pages 83-90

Publisher

ELSEVIER
DOI: 10.1016/j.cpc.2016.05.013

Keywords

GPU; CUDA; Intel MIC; FFT; Monte Carlo; OPAL; mu SR

Ask authors/readers for more resources

Emerging processor architectures such as GPUs and Intel MICs provide a huge performance potential for high, performance computing. However developing software that uses these hardware accelerators introduces additional challenges for the developer. These challenges may include exposing increased parallelism, handling different hardware designs, and using multiple development frameworks in order to utilise devices from different vendors. The Dynamic Kernel Scheduler (DIGS) is being developed in order to provide a software layer between the host application and different hardware accelerators. DKS handles the communication between the host and the device, schedules task execution, and provides a library of built-in algorithms. Algorithms available in the DKS library will be written in CUDA, OpenCL, and OpenMP. Depending on the available hardware, the DKS can select the appropriate implementation of the algorithm. The first DKS version was created using CUDA for the Nvidia GPUs and OpenMP for Intel MIC. DKS was further integrated into OPAL (Object-oriented Parallel Accelerator Library) in order to speed up a parallel EFT based Poisson solver and Monte Carlo simulations for particle matter interaction used for proton therapy degrader modelling. DKS was also used together with Minuit2 for parameter fitting, where x(2) and max-log-likelihood functions were offloaded to the hardware accelerator. The concepts of the DKS, first results, and plans for the future will be shown in this paper. (C) 2016 Elsevier B.V. All rights reserved.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available