December 22, 2020
Conference Paper

CQNN: a CGRA-based QNN Framework

Abstract

We propose a novel Coarse-Grained Reconfigurable Architecture-based (CGRA) QNN acceleration framework, CQNN. CQNN has a large number of basic components for binary functions. By programming CQNN at runtime according to the target QNN models, these basic components are integrated efficiently to support QNN functions with any data-width and hyper-parameter requirements and CQNN is reconfigured to have the optimal architecture for the target models. The framework includes compiler, architecture design, simulator and RTL generator. Experimental results show CQNNs can complete the inference of AlexNet and VGG-16 within 0.13ms and 2.63ms.

Revised: January 29, 2021 | Published: December 22, 2020

Citation

Geng T., C. Wu, C. Tan, B. Fang, A. Li, and M. Herbordt. 2020. CQNN: a CGRA-based QNN Framework. In IEEE High Performance Extreme Computing Conference (HPEC 2020), September 22-24, 2020, Waltham, MA, 1-7. Piscataway, New Jersey:IEEE. PNNL-SA-153940. doi:10.1109/HPEC43674.2020.9286194