Picture for Christopher Potts

Christopher Potts

Shammie

Invisible failures in human-AI interactions

Add code
Mar 16, 2026
Viaarxiv icon

Transcoder Adapters for Reasoning-Model Diffing

Add code
Feb 24, 2026
Viaarxiv icon

Counterfactual Simulation Training for Chain-of-Thought Faithfulness

Add code
Feb 24, 2026
Viaarxiv icon

Language models as tools for investigating the distinction between possible and impossible natural languages

Add code
Dec 10, 2025
Viaarxiv icon

Addressing divergent representations from causal interventions on neural networks

Add code
Nov 06, 2025
Viaarxiv icon

Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs

Add code
Aug 06, 2025
Viaarxiv icon

Improved Representation Steering for Language Models

Add code
May 27, 2025
Viaarxiv icon

CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models

Add code
May 22, 2025
Figure 1 for CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Figure 2 for CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Figure 3 for CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Figure 4 for CHART-6: Human-Centered Evaluation of Data Visualization Understanding in Vision-Language Models
Viaarxiv icon

Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions

Add code
May 21, 2025
Figure 1 for Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions
Figure 2 for Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions
Figure 3 for Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions
Figure 4 for Causal Interventions Reveal Shared Structure Across English Filler-Gap Constructions
Viaarxiv icon

Mechanistic evaluation of Transformers and state space models

Add code
May 21, 2025
Viaarxiv icon