Picture for Louis Thomson

Louis Thomson

Games for AI Control: Models of Safety Evaluations of AI Deployment Protocols

Add code
Sep 12, 2024
Viaarxiv icon

Towards shutdownable agents via stochastic choice

Add code
Jun 30, 2024
Figure 1 for Towards shutdownable agents via stochastic choice
Figure 2 for Towards shutdownable agents via stochastic choice
Figure 3 for Towards shutdownable agents via stochastic choice
Figure 4 for Towards shutdownable agents via stochastic choice
Viaarxiv icon