Picture for Amir Zur

Amir Zur

Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics

Add code
Nov 06, 2025
Viaarxiv icon

MIB: A Mechanistic Interpretability Benchmark

Add code
Apr 17, 2025
Figure 1 for MIB: A Mechanistic Interpretability Benchmark
Figure 2 for MIB: A Mechanistic Interpretability Benchmark
Figure 3 for MIB: A Mechanistic Interpretability Benchmark
Figure 4 for MIB: A Mechanistic Interpretability Benchmark
Viaarxiv icon

Updating CLIP to Prefer Descriptions Over Captions

Add code
Jun 12, 2024
Viaarxiv icon

Causal Proxy Models for Concept-Based Model Explanations

Add code
Sep 28, 2022
Figure 1 for Causal Proxy Models for Concept-Based Model Explanations
Figure 2 for Causal Proxy Models for Concept-Based Model Explanations
Figure 3 for Causal Proxy Models for Concept-Based Model Explanations
Figure 4 for Causal Proxy Models for Concept-Based Model Explanations
Viaarxiv icon