Picture for Dengzhe Hou

Dengzhe Hou

Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance

Add code
Mar 28, 2026
Viaarxiv icon