Picture for Tianlong Nan

Tianlong Nan

Efficient Exploration for Iterative Nash Preference Optimization

Add code
May 31, 2026
Viaarxiv icon