June 1, 2020
Journal Article

Reliability Analysis for Unreliable FSM Computations

Abstract

Finite State Machines (FSMs) are fundamental in both hardware design and software development. However, the reliability of FSM computations remains poorly understood. This becomes increasingly critical as the technology scaling becomes more aggressive. Existing reliability analyses are mainly designed for generic computations and are unaware of the special error tolerance characteristics in FSM computations. In this work, we introduce RelyFSM – a state-level reliability analysis framework for FSM computations. By qualitatively reasoning about the transition structures in FSM computations, RelyFSM can precisely capture their inherent error tolerance. Basically, RelyFSM takes an FSM specification as input and analyzes its reliability for a specific execution. At the core of the analysis is a set of mathematical models that accurately define the behaviors of unreliable FSM executions. Our evaluation with a spectrum of FSM benchmarks confirms the accuracy and efficiency of RelyFSM in analyzing the reliability of real-world FSM computations.

Revised: August 31, 2020 | Published: June 1, 2020

Citation

Sabet A.H., J. Qiu, Z. Zhao, and S. Krishnamoorthy. 2020. Reliability Analysis for Unreliable FSM Computations. ACM Transactions on Architecture and Code Optimization 17, no. 2:Article No. 12. PNNL-SA-150357. doi:10.1145/3377456