A Novel Mixed-Precision Quantization Approach for CNNs

Model size and inference speed have brought about major challenges for the deployment of Convolutional Neural Networks (CNNs) in many applications. An effective approach to address this issue is model quantization, which achieves network compression and inference speedup by reducing the parameters b...

Full description

Saved in:

Bibliographic Details
Main Authors:	Dan Wu, Yanzhi Wang, Yuqi Fei, Guowang Gao
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Convolutional neural network mixed precision quantization model compression second-order information
Online Access:	https://ieeexplore.ieee.org/document/10929039/
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://ieeexplore.ieee.org/document/10929039/

A Novel Mixed-Precision Quantization Approach for CNNs

Internet

Similar Items