Employing directive based compression solutions on accelerators global memory under OpenACC
Date
2016-05-04
Authors
Salehi, Ebad
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Programmers invest extensive development effort to optimize a GPU program to achieve peak performance. Achieving this requires an efficient usage of global memory, and avoiding memory bandwidth underutilization. The OpenACC programming model has been introduced to tackle the accelerators programming complexity. However, this model’s coarse-grained control on a program can make the memory bandwidth utilization even worse compared to the version written in a native GPU languages such as CUDA. We propose an extension to OpenACC in order to reduce the traffic on the memory interconnection network, using a compression method on floating point numbers. We examine our method on six case studies, and achieve up to 1.36X speedup.
Description
Keywords
Accelerators, OpenACC, Compression