A Novel Mixed-Precision Quantization Approach for CNNs

Model size and inference speed have brought about major challenges for the deployment of Convolutional Neural Networks (CNNs) in many applications. An effective approach to address this issue is model quantization, which achieves network compression and inference speedup by reducing the parameters b...

Full description

Saved in:
Bibliographic Details
Main Authors: Dan Wu, Yanzhi Wang, Yuqi Fei, Guowang Gao
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10929039/
Tags: Add Tag
No Tags, Be the first to tag this record!