Alert button
Picture for Stefan Heimersheim

Stefan Heimersheim

Alert button

How to use and interpret activation patching

Add code
Bookmark button
Alert button
Apr 23, 2024
Stefan Heimersheim, Neel Nanda

Viaarxiv icon

Towards Automated Circuit Discovery for Mechanistic Interpretability

Add code
Bookmark button
Alert button
Apr 28, 2023
Arthur Conmy, Augustine N. Mavor-Parker, Aengus Lynch, Stefan Heimersheim, Adrià Garriga-Alonso

Figure 1 for Towards Automated Circuit Discovery for Mechanistic Interpretability
Figure 2 for Towards Automated Circuit Discovery for Mechanistic Interpretability
Figure 3 for Towards Automated Circuit Discovery for Mechanistic Interpretability
Figure 4 for Towards Automated Circuit Discovery for Mechanistic Interpretability
Viaarxiv icon