Alert button

FineQuant: Unlocking Efficiency with Fine-Grained Weight-Only Quantization for LLMs

Add code
Bookmark button
Alert button
Aug 16, 2023
Young Jin Kim, Rawn Henry, Raffy Fahim, Hany Hassan Awadalla

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: