Picture for Logan Riggs

Logan Riggs

Decomposition of Small Transformer Models

Add code
Nov 12, 2025
Viaarxiv icon

Decomposing The Dark Matter of Sparse Autoencoders

Add code
Oct 18, 2024
Figure 1 for Decomposing The Dark Matter of Sparse Autoencoders
Figure 2 for Decomposing The Dark Matter of Sparse Autoencoders
Figure 3 for Decomposing The Dark Matter of Sparse Autoencoders
Figure 4 for Decomposing The Dark Matter of Sparse Autoencoders
Viaarxiv icon

Sparse Autoencoders Find Highly Interpretable Features in Language Models

Add code
Sep 19, 2023
Figure 1 for Sparse Autoencoders Find Highly Interpretable Features in Language Models
Figure 2 for Sparse Autoencoders Find Highly Interpretable Features in Language Models
Figure 3 for Sparse Autoencoders Find Highly Interpretable Features in Language Models
Figure 4 for Sparse Autoencoders Find Highly Interpretable Features in Language Models
Viaarxiv icon