Picture for Andy Q Han

Andy Q Han

How's it going? Reinforcement learning in language models recruits a functional welfare axis

Add code
May 28, 2026
Viaarxiv icon