Picture for Kailing Li

Kailing Li

3SPO: State-Score-Supervised Policy Optimization for LLM Agents

Add code
Jun 08, 2026
Viaarxiv icon