Picture for Jiayi Ren

Jiayi Ren

LatencyPrism: Online Non-intrusive Latency Sculpting for SLO-Guaranteed LLM Inference

Add code
Jan 14, 2026
Viaarxiv icon

Kunlun Anomaly Troubleshooter: Enabling Kernel-Level Anomaly Detection and Causal Reasoning for Large Model Distributed Inference

Add code
Nov 08, 2025
Viaarxiv icon