Picture for Andres Saurez

Andres Saurez

Circuit Fingerprints: How Answer Tokens Encode Their Geometrical Path

Add code
Feb 10, 2026
Viaarxiv icon

Why Linear Interpretability Works: Invariant Subspaces as a Result of Architectural Constraints

Add code
Feb 10, 2026
Viaarxiv icon