April 1, 2023
Conference Paper

A Framework for Neural Network Inference on FPGA-Centric SmartNICs

Abstract

FPGA-based SmartNICs offer great potential to significantly improve the performance of high-performance computing and warehouse data processing by tightly coupling support for reconfigurable data-intensive computation with cross-node communication, thereby mitigating the von Neumann bottleneck. Existing work, however, has been generally been limited in that it assumes an accelerator model where kernels are offloaded to SmartNICs, but most control tasks are left to the CPUs. This leads to frequent waiting, inferior performance, and scaling challenges. In this work, we propose a new distributive data-centric computing framework, named FCsN, for reconfigurable SmartNIC-based systems. Through a lightweight task circulation execution model and its implementation architecture, FCsN allows the complete detaching of kernel execution, control logic, system scheduling, and network communication to the SmarNICs. This boosts performance by: (i) avoiding the control dependency with CPUs and (ii) supporting streaming kernel execution and network communication at line rate and in a very fine-grained manner. We demonstrate the efficiency and flexibility of FCsN using various types of neural network applications including graph neural networks; as these last are both irregular and data intensive they offer an especially robust demonstration. Evaluations using commonly-used neural network models and graph datasets show that a system with the support of FCsN can achieve, on average, 144 speedups over the MPI-based standard CPU baselines.

Published: April 1, 2023

Citation

Guo A., T. Geng, Y. Zhang, P. Haghi, C. Wu, C. Tan, and Y. Lin, et al. 2022. A Framework for Neural Network Inference on FPGA-Centric SmartNICs. In Proceedings of the 32nd International Conference on Field-Programmable Logic and Applications (FPL 2022), August 29-September 2, 2022, Belfast, United Kingdom, 01-08. Piscataway, New Jersey:IEEE. PNNL-SA-169702. doi:10.1109/FPL57034.2022.00071