Picture for Jierui Zuo

Jierui Zuo

DDO-RM for LLM Preference Optimization: A Minimal Held-Out Benchmark against DPO

Add code
Apr 13, 2026
Viaarxiv icon

On Pareto Optimality for the Multinomial Logistic Bandit

Add code
Jan 31, 2025
Figure 1 for On Pareto Optimality for the Multinomial Logistic Bandit
Figure 2 for On Pareto Optimality for the Multinomial Logistic Bandit
Figure 3 for On Pareto Optimality for the Multinomial Logistic Bandit
Viaarxiv icon