Picture for Wan Guanglu

Wan Guanglu

AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models

Add code
Jun 08, 2025
Figure 1 for AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models
Figure 2 for AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models
Figure 3 for AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models
Figure 4 for AMoPO: Adaptive Multi-objective Preference Optimization without Reward Models and Reference Models
Viaarxiv icon