Picture for Hongyao Liu

Hongyao Liu

Inference-Time Budget Control for LLM Search Agents

Add code
May 07, 2026
Viaarxiv icon

SparKV: Overhead-Aware KV Cache Loading for Efficient On-Device LLM Inference

Add code
Apr 23, 2026
Viaarxiv icon