Picture for Lauren Robson

Lauren Robson

CTRL-ALT-DECEIT: Sabotage Evaluations for Automated AI R&D

Add code
Nov 18, 2025
Viaarxiv icon