Picture for Zhiyu Lu

Zhiyu Lu

SEA-Eval: A Benchmark for Evaluating Self-Evolving Agents Beyond Episodic Assessment

Add code
Apr 14, 2026
Viaarxiv icon