Picture for Martin Wattenberg

Martin Wattenberg

Tensor Product Representation Probes Reveal Shared Structure Across Linear Directions

Add code
May 11, 2026
Viaarxiv icon

Intentmaking and Sensemaking: Human Interaction with AI-Guided Mathematical Discovery

Add code
May 07, 2026
Viaarxiv icon

AI Co-Mathematician: Accelerating Mathematicians with Agentic AI

Add code
May 07, 2026
Viaarxiv icon

Large Language Models Generate Harmful Content Using a Distinct, Unified Mechanism

Add code
Apr 10, 2026
Viaarxiv icon

Think Before You Lie: How Reasoning Leads to Honesty

Add code
Mar 16, 2026
Viaarxiv icon

Think Before You Lie: How Reasoning Improves Honesty

Add code
Mar 10, 2026
Viaarxiv icon

Decomposing Query-Key Feature Interactions Using Contrastive Covariances

Add code
Feb 04, 2026
Viaarxiv icon

Does visualization help AI understand data?

Add code
Jul 24, 2025
Figure 1 for Does visualization help AI understand data?
Figure 2 for Does visualization help AI understand data?
Figure 3 for Does visualization help AI understand data?
Figure 4 for Does visualization help AI understand data?
Viaarxiv icon

Can Interpretation Predict Behavior on Unseen Data?

Add code
Jul 08, 2025
Viaarxiv icon

When Bad Data Leads to Good Models

Add code
May 07, 2025
Figure 1 for When Bad Data Leads to Good Models
Figure 2 for When Bad Data Leads to Good Models
Figure 3 for When Bad Data Leads to Good Models
Figure 4 for When Bad Data Leads to Good Models
Viaarxiv icon