Picture for Thomas Fel

Thomas Fel

ANITI

Cross-Modal Redundancy and the Geometry of Vision-Language Embeddings

Add code
Feb 05, 2026
Viaarxiv icon

Interpreting Physics in Video World Models

Add code
Feb 04, 2026
Viaarxiv icon

Block-Recurrent Dynamics in Vision Transformers

Add code
Dec 23, 2025
Figure 1 for Block-Recurrent Dynamics in Vision Transformers
Figure 2 for Block-Recurrent Dynamics in Vision Transformers
Figure 3 for Block-Recurrent Dynamics in Vision Transformers
Figure 4 for Block-Recurrent Dynamics in Vision Transformers
Viaarxiv icon

Back to the Baseline: Examining Baseline Effects on Explainability Metrics

Add code
Dec 12, 2025
Figure 1 for Back to the Baseline: Examining Baseline Effects on Explainability Metrics
Figure 2 for Back to the Baseline: Examining Baseline Effects on Explainability Metrics
Figure 3 for Back to the Baseline: Examining Baseline Effects on Explainability Metrics
Figure 4 for Back to the Baseline: Examining Baseline Effects on Explainability Metrics
Viaarxiv icon

A Geometric Unification of Concept Learning with Concept Cones

Add code
Dec 08, 2025
Viaarxiv icon

Uncovering Conceptual Blindspots in Generative Image Models Using Sparse Autoencoders

Add code
Jun 24, 2025
Viaarxiv icon

Evaluating Sparse Autoencoders: From Shallow Design to Matching Pursuit

Add code
Jun 05, 2025
Figure 1 for Evaluating Sparse Autoencoders: From Shallow Design to Matching Pursuit
Figure 2 for Evaluating Sparse Autoencoders: From Shallow Design to Matching Pursuit
Figure 3 for Evaluating Sparse Autoencoders: From Shallow Design to Matching Pursuit
Figure 4 for Evaluating Sparse Autoencoders: From Shallow Design to Matching Pursuit
Viaarxiv icon

Interpreting the Linear Structure of Vision-language Model Embedding Spaces

Add code
Apr 16, 2025
Figure 1 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Figure 2 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Figure 3 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Figure 4 for Interpreting the Linear Structure of Vision-language Model Embedding Spaces
Viaarxiv icon

Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry

Add code
Mar 03, 2025
Figure 1 for Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry
Figure 2 for Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry
Figure 3 for Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry
Figure 4 for Projecting Assumptions: The Duality Between Sparse Autoencoders and Concept Geometry
Viaarxiv icon

Archetypal SAE: Adaptive and Stable Dictionary Learning for Concept Extraction in Large Vision Models

Add code
Feb 18, 2025
Viaarxiv icon