Picture for Carissa Cullen

Carissa Cullen

Towards Shutdownable Agents: Generalizing Stochastic Choice in RL Agents and LLMs

Add code
Apr 19, 2026
Viaarxiv icon

Detecting Multi-Agent Collusion Through Multi-Agent Interpretability

Add code
Apr 01, 2026
Viaarxiv icon