Picture for Yunsheng Lu

Yunsheng Lu

Optimal Transport for LLM Reward Modeling from Noisy Preference

Add code
May 07, 2026
Viaarxiv icon

Robust Reward Modeling for Large Language Models via Causal Decomposition

Add code
Apr 16, 2026
Viaarxiv icon

A Causal Perspective for Enhancing Jailbreak Attack and Defense

Add code
Jan 31, 2026
Viaarxiv icon