A Novel Mixed-Precision Quantization Approach for CNNs
Model size and inference speed have brought about major challenges for the deployment of Convolutional Neural Networks (CNNs) in many applications. An effective approach to address this issue is model quantization, which achieves network compression and inference speedup by reducing the parameters b...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10929039/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|