Picture for Morgan Sinclaire

Morgan Sinclaire

When can we trust untrusted monitoring? A safety case sketch across collusion strategies

Add code
Feb 24, 2026
Viaarxiv icon