Picture for Changjiang Han

Changjiang Han

Preference Heads in Large Language Models: A Mechanistic Framework for Interpretable Personalization

Add code
Apr 24, 2026
Viaarxiv icon