Picture for Raja Mehta Moreno

Raja Mehta Moreno

How does information access affect LLM monitors' ability to detect sabotage?

Add code
Jan 28, 2026
Viaarxiv icon

CTRL-ALT-DECEIT: Sabotage Evaluations for Automated AI R&D

Add code
Nov 18, 2025
Viaarxiv icon