Picture for Hyunji Nam

Hyunji Nam

Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries

Add code
Jul 17, 2025
Figure 1 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Figure 2 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Figure 3 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Figure 4 for Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries
Viaarxiv icon

Predicting Long Term Sequential Policy Value Using Softer Surrogates

Add code
Dec 30, 2024
Viaarxiv icon