Addressing Activation Outliers in LLMs: A Systematic Review of Post-Training Quantization Techniques

Large Language Models (LLMs) have transformed natural language processing, yet their deployment remains challenging due to substantial computational, memory, and energy demands. Post-training quantization has emerged as a key strategy for enabling efficient inference, particularly in resource-constr...

Full description

Saved in:
Bibliographic Details
Main Authors: Patrik Czako, Gabor Kertesz, Sandor Szenasi
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10994764/
Tags: Add Tag
No Tags, Be the first to tag this record!