Picture for Fangzhi Zhu

Fangzhi Zhu

MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples

Add code
Dec 13, 2024
Figure 1 for MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples
Figure 2 for MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples
Figure 3 for MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples
Figure 4 for MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples
Viaarxiv icon