Picture for Zelin Tan

Zelin Tan

Select-then-Solve: Paradigm Routing as Inference-Time Optimization for LLM Agents

Add code
Apr 08, 2026
Viaarxiv icon

Stabilizing Rubric Integration Training via Decoupled Advantage Normalization

Add code
Mar 27, 2026
Viaarxiv icon