Alert button

IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

Add code
Bookmark button
Alert button
Mar 02, 2024
Ruikang Liu, Haoli Bai, Haokun Lin, Yuening Li, Han Gao, Zhengzhuo Xu, Lu Hou, Jun Yao, Chun Yuan

Figure 1 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 2 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 3 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact
Figure 4 for IntactKV: Improving Large Language Model Quantization by Keeping Pivot Tokens Intact

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: