Picture for Hyunji Nam

Hyunji Nam

Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries

Add code
Jul 17, 2025
Viaarxiv icon

Predicting Long Term Sequential Policy Value Using Softer Surrogates

Add code
Dec 30, 2024
Viaarxiv icon