Picture for YiMing Cheng

YiMing Cheng

VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits

Add code
May 15, 2025
Viaarxiv icon