Convolution Smooth: A Post-Training Quantization Method for Convolutional Neural Networks
Convolutional neural network (CNN) quantization is an efficient model compression technique primarily used for accelerating inference and optimizing resources. However, existing methods often apply different quantization strategies to activations and weights, without considering their interplay. To...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10955493/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|