Alert button

Codebook Features: Sparse and Discrete Interpretability for Neural Networks

Add code
Bookmark button
Alert button
Oct 26, 2023
Alex Tamkin, Mohammad Taufeeque, Noah D. Goodman

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: