Tunable Energy-Efficient Approximate Circuits for Self-Powered AI and Autonomous Edge Computing Systems

Artificial Intelligence is applied in various domains of compute-intensive applications ranging from image recognition healthcare to statistical analysis. Additionally, recent advancements in Deep Neural Network (DNN) running on millions of devices for various AI tasks deliver an accuracy comparable...

Full description

Saved in:
Bibliographic Details
Main Authors: Shubham Garg, Kanika Monga, Nitin Chaturvedi, S. Gurunarayanan
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10912512/
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Artificial Intelligence is applied in various domains of compute-intensive applications ranging from image recognition healthcare to statistical analysis. Additionally, recent advancements in Deep Neural Network (DNN) running on millions of devices for various AI tasks deliver an accuracy comparable to human levels. However, the accuracy in computations came with an additional cost of increased computational resources and power consumption in traditional computing units. Moreover, this problem becomes more complex while deploying computationally intensive heavy machine learning (ML) models on energy-constrained edge devices. Approximate computing has emerged as a promising paradigm for error-tolerant AI/ML applications deployed on energy-constrained edge devices where the complexity of hardware computing units can be reduced by optimizing circuit logic while slightly trading off computational accuracy. Therefore, we propose novel approximate compressors to design multiply and accumulate (MAC) hardware unit of Deep Neural Network and Convolutional Neural Network (DNN/CNN) that achieve energy-efficient and faster computations with slightly reduced precision. We also propose tunable compressors and MAC unit that support switching between two approximation modes to enable runtime adjustment of energy efficiency and accuracy for energy-autonomous edge devices. We validated and verified the design of the proposed approximate circuit at 7nm and 55nm technology nodes. The simulation result for the tunable compressor shows an average reduction of 49 % in energy consumption, and 30 % in delay compared to the state-of-the-art compressor. In addition, an average reduction of 36 % in energy consumption and 18 % in delay was observed for the MAC unit compared with the conventional MAC. To analyze the impact on the accuracy of the final output, we evaluated the error matrices for the approximate MAC unit and observed a low MRED (mean relative error distance) of 0.19. Furthermore, we implemented a simple CNN for pattern detection and integrated our proposed solution to validate its benefits, achieving 15% lower power consumption and 7% lower area overhead without functionality loss.
ISSN:2169-3536