Alert button

Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge

Dec 09, 2023
Xuan Shen, Peiyan Dong, Lei Lu, Zhenglun Kong, Zhengang Li, Ming Lin, Chao Wu, Yanzhi Wang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: