Picture for Yixi Li

Yixi Li

Ada-RS: Adaptive Rejection Sampling for Selective Thinking

Add code
Feb 23, 2026
Viaarxiv icon

Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents

Add code
Feb 18, 2026
Viaarxiv icon