Alert button

Extreme Compression of Large Language Models via Additive Quantization

Add code
Bookmark button
Alert button
Jan 11, 2024
Vage Egiazarian, Andrei Panferov, Denis Kuznedelev, Elias Frantar, Artem Babenko, Dan Alistarh

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: