WAPS-Quant: Low-Bit Post-Training Quantization Using Weight-Activation Product Scaling

WAPS-Quant: Low-Bit Post-Training Quantization Using Weight-Activation Product Scaling

Post-Training Quantization (PTQ) has been effectively compressing neural networks into very few bits using a limited calibration dataset. Various quantization methods utilizing second-order error have been proposed and demonstrated good performance. However, at extremely low bits, the increase in qu...

Full description

Saved in:

Bibliographic Details
Main Authors:	Geunjae Choi, Kamin Lee, Nojun Kwak
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Post-training quantization (PTQ) low-bit quantization weight-activation product scaling channel-wise grouping ASIC
Online Access:	https://ieeexplore.ieee.org/document/10982219/
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

COMQ: A Backpropagation-Free Algorithm for Post-Training Quantization
by: Aozhong Zhang, et al.
Published: (2025-01-01)

Evaluation of Information Indicators of Quantization Noise Generated by Speech Signal Analogue-Digital Conversion
by: V. K. Zheleznyak, et al.
Published: (2023-04-01)

Synthesis of a measuring composite signal for assessing the security of speech signals during discrete-quantized transformation
by: V. K. Zheleznjak, et al.
Published: (2020-10-01)

Burau representation of $B_4$ and quantization of the rational projective plane
by: Jouteur, Perrine
Published: (2025-03-01)

Conditional Optimal Sets and the Quantization Coefficients for Some Uniform Distributions
by: Evans Nyanney, et al.
Published: (2025-07-01)

Reducing Memory and Computational Cost for Deep Neural Network Training with Quantized Parameter Updates
by: Leo Buron, et al.
Published: (2025-08-01)

Fully Quantized Matrix Arithmetic-Only BERT Model and Its FPGA-Based Accelerator
by: Hiroshi Fuketa, et al.
Published: (2025-01-01)

Two Amino Acid Substitutions Improve the Pharmacological Profile of the Snake Venom Peptide Mambalgin
by: Dmitry I. Osmakov, et al.
Published: (2025-02-01)

Adaptive two-threshold quantization and image segmentation based on the splitting and merging areas
by: O. M. Almiahi, et al.
Published: (2019-06-01)

Self-Supervised Pretraining and Quantization for Fault Tolerant Neural Networks: Friend or Foe?
by: Rosario Milazzo, et al.
Published: (2025-01-01)

Transmit Waveform Design for Symbol-level Precoding-based One-bit Dual-functional Radar-communication
by: Zhuoyuan HE, et al.
Published: (2025-08-01)

Optimizing FCN for devices with limited resources using quantization and sparsity enhancement
by: Muhammad Faizan-Khan, et al.
Published: (2025-08-01)

Control of the formation and detection of on-axis and off-axis diffraction orders with two-level phase quantization of a vortex lens
by: O.A. Dyukareva, et al.
Published: (2025-02-01)

AFQSeg: An Adaptive Feature Quantization Network for Instance-Level Surface Crack Segmentation
by: Shaoliang Fang, et al.
Published: (2025-05-01)

Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques
by: Patrik Czako, et al.
Published: (2025-01-01)

An Improved Low-Bit-Rate Image Compression Framework Based on Semantic-Aware Model and Neighborhood Attention
by: Chengbin Zeng, et al.
Published: (2025-01-01)

Yang-Mills theory from super moduli space
by: Carlo Alberto Cremonini, et al.
Published: (2025-07-01)

Digital Spectral Analysis by means of the Method of Averag Modified Periodograms Using Binary-Sign Stochastic Quantization of Signals
by: V. N. Yakimov
Published: (2021-10-01)

Utility of lower extremity ultrasound prior to application of serial compression device in patients with COVID-19: “choosing wisely” initiative at a major referral center in the Middle East
by: Tedy Sawma, et al.
Published: (2025-07-01)

POQ: Is There a Pareto-Optimal Quantization Strategy for Deep Neural Networks?
by: Floran De Putter, et al.
Published: (2025-01-01)

Optimizing binary neural network quantization for fixed pattern noise robustness
by: Francisco Javier Andreo-Oliver, et al.
Published: (2025-07-01)

Spatially and temporally correlated channel estimation and detection for comparator network-aided MIMO receivers with 1-bit ADCs
by: Luiz Sampaio, et al.
Published: (2025-08-01)

Designing and Validation of Modara Scale Based on Islamic Teachings
by: Maryam Bemani Naeeni, et al.
Published: (2024-12-01)

Transforming the Religious Landscape: Emperor Leo VI and His Struggle for Supremacy over the Church
by: Nathan Leidholm
Published: (2024-03-01)

Conformal deep forest for uncertainty-aware classification
by: Jing Zhang, et al.
Published: (2025-08-01)

Open string field theory in lightcone gauge
by: Theodore Erler
Published: (2025-04-01)

Large language models for PHM: a review of optimization techniques and applications
by: Tingyi Yu, et al.
Published: (2025-08-01)

Deep representation learning using layer-wise VICReg losses
by: Joy Datta, et al.
Published: (2025-07-01)

Compression-based Data Reduction Technique for IoT Sensor Networks
by: Suha Abdulhussein Abdulzahra, et al.
Published: (2021-03-01)

Impact of Wireless Network Packet Loss on Real-Time Video Streaming Application: A Comparative Study of H.265 and H.266 Codecs
by: Miran Taha Abdullah, et al.
Published: (2024-09-01)

Worldline geometries for scattering amplitudes
by: Roberto Bonezzi, et al.
Published: (2025-06-01)

Efficient Modification of the CRAFT Algorithm for Layout Optimisation
by: Borut Buchmeister, et al.
Published: (2025-01-01)

Creation of a legend: The interinfluence of historiography and romance tradition in the plot about the last Visigothic king Rodrigo
by: N. K. Kiselyova
Published: (2022-06-01)

TSDCA-BA: An Ultra-Lightweight Speech Enhancement Model for Real-Time Hearing Aids with Multi-Scale STFT Fusion
by: Zujie Fan, et al.
Published: (2025-07-01)

A note on partially massless supergravity
by: Nicolas Boulanger, et al.
Published: (2025-07-01)

Choosing Wisely Canada rhinology recommendations
by: Neil Arnstead, et al.
Published: (2020-02-01)

Nevelési modellek a Példabeszédek könyvében
by: János MOLNÁR
Published: (2017-06-01)

San Domenico nel cielo dei sapienti
by: Lucia Bodecchi
Published: (2022-12-01)

On the implementation of a secured watermarking mechanism based on cryptography and bit pairs matching
by: Sanjeev Narayan Bal, et al.
Published: (2021-06-01)

METHOD OF PROOF INVARIABLY IMAGES IN THE COMPUTER!TECHNICAL EXPERTISE "FROM EXPERT PRACTICE#
by: N. Khatuntsev, et al.
Published: (2014-09-01)