A novel strategy for compact finite difference evaluation on gpu-accelerated clusters


Abstract eng:
In this work, we present a novel strategy to solve the tridiagonal systems arising in such numerical schemes as compact finite differences and alternating direction implicit methods on graphics processing units (GPUs). We demonstrate the impact of the simple matrix structure on the cyclic reduction algorithm, and show that precomputation of coefficients appearing in the algorithm becomes feasible and efficient. We demonstrate that an implementation using our approach is able to outperform the NVIDIA CUSPARSE and multithreaded Intel MKL solvers on GPU and CPU, respectively. We apply this tridiagonal solver to the solution of compact finite differences on multiple GPUs distributed in a cluster, and show scaling for up to 64 GPUs.

Publisher:
International Union of Theoretical and Applied Mechanics, 2016
Conference Title:
Conference Title:
24th International Congress of Theoretical and Applied Mechanics
Conference Venue:
Montreal (CA)
Conference Dates:
2016-08-21 / 2016-08-26
Rights:
Text je chráněný podle autorského zákona č. 121/2000 Sb.



Record appears in:



 Record created 2016-11-15, last modified 2016-11-15


Original version of the author's contribution as presented on CD, XMLout( page 3121, code PO.FS02-1.03.339).:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)