Picture for Ivgeni Segal

Ivgeni Segal

Hardening Agent Benchmarks with Adversarial Hacker-Fixer Loops

Add code
Jun 08, 2026
Viaarxiv icon

Terminal Wrench: A Dataset of 331 Reward-Hackable Environments and 3,632 Exploit Trajectories

Add code
Apr 19, 2026
Viaarxiv icon