Picture for Thomas Klein

Thomas Klein

Low-Pass Filtering Improves Behavioral Alignment of Vision Models

Add code
Feb 14, 2026
Viaarxiv icon

MentisOculi: Revealing the Limits of Reasoning with Mental Imagery

Add code
Feb 02, 2026
Viaarxiv icon

How Aligned are Different Alignment Metrics?

Add code
Jul 10, 2024
Figure 1 for How Aligned are Different Alignment Metrics?
Figure 2 for How Aligned are Different Alignment Metrics?
Figure 3 for How Aligned are Different Alignment Metrics?
Figure 4 for How Aligned are Different Alignment Metrics?
Viaarxiv icon

Scale Alone Does not Improve Mechanistic Interpretability in Vision Models

Add code
Jul 11, 2023
Figure 1 for Scale Alone Does not Improve Mechanistic Interpretability in Vision Models
Figure 2 for Scale Alone Does not Improve Mechanistic Interpretability in Vision Models
Figure 3 for Scale Alone Does not Improve Mechanistic Interpretability in Vision Models
Figure 4 for Scale Alone Does not Improve Mechanistic Interpretability in Vision Models
Viaarxiv icon