Picture for Jiesong Lian

Jiesong Lian

SoliReward: Mitigating Susceptibility to Reward Hacking and Annotation Noise in Video Generation Reward Models

Add code
Dec 17, 2025
Viaarxiv icon

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

Add code
Jun 03, 2024
Viaarxiv icon