Picture for Ruibo Fan

Ruibo Fan

ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression

Add code
Mar 18, 2026
Viaarxiv icon

Dissecting Outlier Dynamics in LLM NVFP4 Pretraining

Add code
Feb 02, 2026
Viaarxiv icon

Dissecting the Runtime Performance of the Training, Fine-tuning, and Inference of Large Language Models

Add code
Nov 07, 2023
Viaarxiv icon