Picture for Wan Guanglu

Wan Guanglu

AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models

Add code
Jun 08, 2025
Viaarxiv icon