Picture for Zixi Li

Zixi Li

Zhejiang University

Reasoning: From Reflection to Solution

Add code
Nov 12, 2025
Viaarxiv icon

Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference Alignment

Add code
Jun 24, 2025
Viaarxiv icon