Picture for David Bau

David Bau

Gaze Heads: How VLMs Look at What They Describe

Add code
Jun 12, 2026
Viaarxiv icon

The Piggyback Hypothesis of Generalization: Explaining and Mitigating Emergent Misalignment

Add code
Jun 04, 2026
Viaarxiv icon

The Dual Mechanisms of Spatial Reasoning in Vision-Language Models

Add code
Mar 23, 2026
Viaarxiv icon

Agents of Chaos

Add code
Feb 23, 2026
Viaarxiv icon

Mechanisms of AI Protein Folding in ESMFold

Add code
Feb 05, 2026
Viaarxiv icon

Do explanations generalize across large reasoning models?

Add code
Jan 16, 2026
Viaarxiv icon

In-Context Algebra

Add code
Dec 18, 2025
Viaarxiv icon

In-Context Learning Without Copying

Add code
Nov 07, 2025
Viaarxiv icon

LLMs Process Lists With General Filter Heads

Add code
Oct 30, 2025
Viaarxiv icon

LLMs Encode Harmfulness and Refusal Separately

Add code
Jul 16, 2025
Viaarxiv icon