Picture for Tim Lawson

Tim Lawson

Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations

Add code
Feb 25, 2025
Figure 1 for Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Figure 2 for Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Figure 3 for Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Figure 4 for Jacobian Sparse Autoencoders: Sparsify Computations, Not Just Activations
Viaarxiv icon

Sparse Autoencoders Can Interpret Randomly Initialized Transformers

Add code
Jan 29, 2025
Viaarxiv icon

Residual Stream Analysis with Multi-Layer SAEs

Add code
Sep 06, 2024
Figure 1 for Residual Stream Analysis with Multi-Layer SAEs
Figure 2 for Residual Stream Analysis with Multi-Layer SAEs
Figure 3 for Residual Stream Analysis with Multi-Layer SAEs
Figure 4 for Residual Stream Analysis with Multi-Layer SAEs
Viaarxiv icon