Picture for XingXing Wei

XingXing Wei

Improving Safety Alignment via Balanced Direct Preference Optimization

Add code
Mar 24, 2026
Viaarxiv icon