Picture for Catherine Ge-Wang

Catherine Ge-Wang

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety

Add code
Jun 03, 2026
Viaarxiv icon