Picture for Tyler Tracy

Tyler Tracy

MonitoringBench: Semi-Automated Red-Teaming for Agent Monitoring

Add code
May 10, 2026
Viaarxiv icon

Attack Selection Reduces Safety in Concentrated AI Control Settings against Trusted Monitoring

Add code
Feb 04, 2026
Viaarxiv icon

BashArena: A Control Setting for Highly Privileged AI Agents

Add code
Dec 17, 2025
Figure 1 for BashArena: A Control Setting for Highly Privileged AI Agents
Figure 2 for BashArena: A Control Setting for Highly Privileged AI Agents
Figure 3 for BashArena: A Control Setting for Highly Privileged AI Agents
Figure 4 for BashArena: A Control Setting for Highly Privileged AI Agents
Viaarxiv icon

Ctrl-Z: Controlling AI Agents via Resampling

Add code
Apr 14, 2025
Figure 1 for Ctrl-Z: Controlling AI Agents via Resampling
Figure 2 for Ctrl-Z: Controlling AI Agents via Resampling
Figure 3 for Ctrl-Z: Controlling AI Agents via Resampling
Figure 4 for Ctrl-Z: Controlling AI Agents via Resampling
Viaarxiv icon