Tag-Split Cache for Efficient GPGPU Cache Utilization

June 1, 2016

Conference Paper

Tag-Split Cache for Efficient GPGPU Cache Utilization

Abstract

Modern GPUs employ cache to improve memory system efficiency. However, large amount of cache space is underutilized due to irregular memory accesses and poor spatial locality which exhibited commonly in GPU applications. Our experiments show that using smaller cache lines could improve cache space utilization, but it also frequently suffers from significant performance loss by introducing large amount of extra cache requests. In this work, we propose a novel cache design named tag-split cache (TSC) that enables fine-grained cache storage to address the problem of cache space underutilization while keeping memory request number unchanged. TSC divides tag into two parts to reduce storage overhead, and it supports multiple cache line replacement in one cycle.

Revised: September 1, 2016 | Published: June 1, 2016

Citation

Li L., A. Hayes, S. Song, and E. Zhang. 2016. Tag-Split Cache for Efficient GPGPU Cache Utilization. In Proceedings of the International Conference on Supercomputing (ICS 2016), June 1-3, 2016, Istanbul, Turkey, Paper No. 43. New York, New York:ACM. PNNL-SA-117315. doi:10.1145/2925426.2926253