Picture for Yuxuan Yue

Yuxuan Yue

RWKVQuant: Quantizing the RWKV Family with Proxy Guided Hybrid of Scalar and Vector Quantization

Add code
May 02, 2025
Viaarxiv icon

WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More

Add code
Feb 20, 2024
Viaarxiv icon