Picture for Hui Dong

Hui Dong

VFA: Relieving Vector Operations in Flash Attention with Global Maximum Pre-computation

Add code
Apr 14, 2026
Viaarxiv icon

OSC: Hardware Efficient W4A4 Quantization via Outlier Separation in Channel Dimension

Add code
Apr 14, 2026
Viaarxiv icon