Picture for Robert Graham

Robert Graham

Moral Preferences of LLMs Under Directed Contextual Influence

Add code
Feb 26, 2026
Viaarxiv icon

Prisma: An Open Source Toolkit for Mechanistic Interpretability in Vision and Video

Add code
Apr 28, 2025
Viaarxiv icon

Steering CLIP's vision transformer with sparse autoencoders

Add code
Apr 11, 2025
Figure 1 for Steering CLIP's vision transformer with sparse autoencoders
Figure 2 for Steering CLIP's vision transformer with sparse autoencoders
Figure 3 for Steering CLIP's vision transformer with sparse autoencoders
Figure 4 for Steering CLIP's vision transformer with sparse autoencoders
Viaarxiv icon