Picture for Kexin Chu

Kexin Chu

Latency-Quality Routing for Functionally Equivalent Tools in LLM Agents

Add code
May 14, 2026
Viaarxiv icon

From Stateless Queries to Autonomous Actions: A Layered Security Framework for Agentic AI Systems

Add code
Apr 25, 2026
Viaarxiv icon

Dynamic Expert Quantization for Scalable Mixture-of-Experts Inference

Add code
Nov 19, 2025
Viaarxiv icon

ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference

Add code
Oct 30, 2025
Viaarxiv icon

PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization

Add code
Sep 15, 2025
Figure 1 for PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization
Figure 2 for PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization
Figure 3 for PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization
Figure 4 for PromptSculptor: Multi-Agent Based Text-to-Image Prompt Optimization
Viaarxiv icon

Selective KV-Cache Sharing to Mitigate Timing Side-Channels in LLM Inference

Add code
Aug 11, 2025
Viaarxiv icon