Picture for Benjamin Hadad

Benjamin Hadad

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety

Add code
Jun 03, 2026
Viaarxiv icon