September 11, 2016
Conference Paper

Combating the Reliability Challenge of GPU Register File at Low Supply Voltage

Abstract

Supply voltage reduction is an effective approach to significantly reduce GPU energy consumption. As the largest on-chip storage structure, the GPU register file becomes the reliability hotspot that prevents further supply voltage reduction below the safe limit (Vmin) due to process variation effects. This work addresses the reliability challenge of the GPU register file at low supply voltages, which is an essential first step for aggressive supply voltage reduction of the entire GPU chip. We propose GR-Guard, an architectural solution that leverages long register dead time to enable reliable operations from unreliable register file at low voltages.

Revised: January 5, 2017 | Published: September 11, 2016

Citation

Tan J., S. Song, K. Yan, X. Fu, A. Marquez, and D.J. Kerbyson. 2016. Combating the Reliability Challenge of GPU Register File at Low Supply Voltage. In Proceedings of the 25th International Conference on Parallel Architectures and Compilation (PACT '16), September 11-15, 2016, Haifa, Israel, 3-15. New York, New York:ACM. PNNL-SA-119484. doi:10.1145/2967938.2967951