Picture for Xuejian Gong

Xuejian Gong

HMI: Hierarchical Knowledge Management for Efficient Multi-Tenant Inference in Pretrained Language Models

Add code
Apr 24, 2025
Viaarxiv icon

Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models

Add code
Feb 19, 2025
Viaarxiv icon