Picture for Hyunji Nam

Hyunji Nam

Pigeonholing: Bad prompts hurt models to collapse and make mistakes

Add code
Jun 23, 2026
Viaarxiv icon

Mitigating LLM biases toward spurious social contexts using direct preference optimization

Add code
Apr 02, 2026
Viaarxiv icon

Netflix Artwork Personalization via LLM Post-training

Add code
Jan 06, 2026
Viaarxiv icon

Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries

Add code
Jul 17, 2025
Figure 1 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Figure 2 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Figure 3 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Figure 4 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Viaarxiv icon

Predicting Long Term Sequential Policy Value Using Softer Surrogates

Add code
Dec 30, 2024
Viaarxiv icon