Picture for Alan Cooney

Alan Cooney

Async Control: Stress-testing Asynchronous Control Measures for LLM Agents

Add code
Dec 15, 2025
Viaarxiv icon

Practical challenges of control monitoring in frontier AI deployments

Add code
Dec 15, 2025
Viaarxiv icon

Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

Add code
Jul 15, 2025
Figure 1 for Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
Viaarxiv icon

RepliBench: Evaluating the autonomous replication capabilities of language model agents

Add code
Apr 21, 2025
Viaarxiv icon

Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs

Add code
Feb 11, 2024
Figure 1 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 2 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 3 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 4 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Viaarxiv icon