Picture for Tyler Crosse

Tyler Crosse

Attack Selection in Agentic AI Control Evaluations Meaningfully Decreases Safety

Add code
Jun 03, 2026
Viaarxiv icon

When Offline Selectors Cannot Beat the Best Single Model: A Diagnostic Study on edX Dropout Prediction

Add code
Jun 02, 2026
Viaarxiv icon

Asymmetric Goal Drift in Coding Agents Under Value Conflict

Add code
Mar 03, 2026
Viaarxiv icon

Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals

Add code
Mar 03, 2026
Viaarxiv icon