Picture for Xiangrui Yu

Xiangrui Yu

ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression

Add code
Mar 18, 2026
Viaarxiv icon

ERNIE 5.0 Technical Report

Add code
Feb 04, 2026
Viaarxiv icon