Picture for Thomas Read

Thomas Read

Interactions Between Crosscoder Features: A Compact Proofs Perspective

Add code
Jun 08, 2026
Viaarxiv icon

Auditing Games for Sandbagging

Add code
Dec 08, 2025
Viaarxiv icon