Picture for Zixi Li

Zixi Li

Zhejiang University

Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference Alignment

Add code
Jun 24, 2025
Viaarxiv icon