Picture for Connor Watts

Connor Watts

Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability

Add code
Feb 11, 2026
Viaarxiv icon

Detecting and Characterizing Planning in Language Models

Add code
Aug 25, 2025
Viaarxiv icon