Picture for Zhennan Jiang

Zhennan Jiang

Posterior Optimization with Clipped Objective for Bridging Efficiency and Stability in Generative Policy Learning

Add code
Apr 02, 2026
Viaarxiv icon

WoVR: World Models as Reliable Simulators for Post-Training VLA Policies with RL

Add code
Feb 15, 2026
Viaarxiv icon

TeViR: Text-to-Video Reward with Diffusion Models for Efficient Reinforcement Learning

Add code
May 26, 2025
Viaarxiv icon