Picture for Zefan Yu

Zefan Yu

BenchTrace: A Benchmark for Testing Reflection Ability and Controlled Evolution in LLM Agents

Add code
May 28, 2026
Viaarxiv icon