Picture for Alan Cooney

Alan Cooney

Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety

Add code
Jul 15, 2025
Viaarxiv icon

RepliBench: Evaluating the autonomous replication capabilities of language model agents

Add code
Apr 21, 2025
Viaarxiv icon

Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs

Add code
Feb 11, 2024
Figure 1 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 2 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 3 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Figure 4 for Summing Up the Facts: Additive Mechanisms Behind Factual Recall in LLMs
Viaarxiv icon