Picture for Max Loeffler

Max Loeffler

Anatomy of Post-Training: Using Interpretability to Characterize Data and Shape the Learning Signal

Add code
Jun 10, 2026
Viaarxiv icon

Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought

Add code
Mar 05, 2026
Viaarxiv icon