Picture for Amir Zur

Amir Zur

Agents of Chaos

Add code
Feb 23, 2026
Viaarxiv icon

Surgical Activation Steering via Generative Causal Mediation

Add code
Feb 17, 2026
Viaarxiv icon

Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics

Add code
Nov 06, 2025
Viaarxiv icon

MIB: A Mechanistic Interpretability Benchmark

Add code
Apr 17, 2025
Figure 1 for MIB: A Mechanistic Interpretability Benchmark
Figure 2 for MIB: A Mechanistic Interpretability Benchmark
Figure 3 for MIB: A Mechanistic Interpretability Benchmark
Figure 4 for MIB: A Mechanistic Interpretability Benchmark
Viaarxiv icon

Updating CLIP to Prefer Descriptions Over Captions

Add code
Jun 12, 2024
Viaarxiv icon

Causal Proxy Models for Concept-Based Model Explanations

Add code
Sep 28, 2022
Figure 1 for Causal Proxy Models for Concept-Based Model Explanations
Figure 2 for Causal Proxy Models for Concept-Based Model Explanations
Figure 3 for Causal Proxy Models for Concept-Based Model Explanations
Figure 4 for Causal Proxy Models for Concept-Based Model Explanations
Viaarxiv icon