With the advance of contemporary computer architectures, there is a continuously growing performance gap between processors and memories. Data compression is a promising approach to resolve this issue by reducing the data size. On general-purpose graphics processing unit (GPGPU), many parallel computing applications have large amounts of floating-point data. In this paper, we propose an innovative compression algorithm with low latency by exploiting regular data patterns. By integrating the proposed architecture with the memory controller, the GPU can averagely reduce 44.46% of the memory bandwidth usage, and reduce 44.34% of the energy consumption.
All Science Journal Classification (ASJC) codes