Picture for Callum McDougall

Callum McDougall

SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability

Add code
Mar 13, 2025
Viaarxiv icon

Copy Suppression: Comprehensively Understanding an Attention Head

Add code
Oct 06, 2023
Figure 1 for Copy Suppression: Comprehensively Understanding an Attention Head
Figure 2 for Copy Suppression: Comprehensively Understanding an Attention Head
Figure 3 for Copy Suppression: Comprehensively Understanding an Attention Head
Figure 4 for Copy Suppression: Comprehensively Understanding an Attention Head
Viaarxiv icon