Alert button

Enhancing Computation Efficiency in Large Language Models through Weight and Activation Quantization

Nov 09, 2023
Jangwhan Lee, Minsoo Kim, Seungcheol Baek, Seok Joong Hwang, Wonyong Sung, Jungwook Choi

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: