Picture for Xueming Han

Xueming Han

Where Do Deep-Research Agents Go Wrong? Span-Level Error Localization in Agent Trajectories

Add code
Jun 01, 2026
Viaarxiv icon

DR$^{3}$-Eval: Towards Realistic and Reproducible Deep Research Evaluation

Add code
Apr 16, 2026
Viaarxiv icon