ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

Add code
May 25, 2025
Figure 1 for ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment
Figure 2 for ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment
Figure 3 for ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment
Figure 4 for ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: