Picture for Xiangbo Qi

Xiangbo Qi

Fast NF4 Dequantization Kernels for Large Language Model Inference

Add code
Apr 02, 2026
Viaarxiv icon