Picture for Zezhong Qian

Zezhong Qian

MV-WAM: Manifold-Aware World Action Model with Value Augmentation

Add code
Jun 19, 2026
Viaarxiv icon

WAM-RL: World-Action Model Reinforcement Learning with Reconstruction Rewards and Online Video SFT

Add code
Jun 16, 2026
Viaarxiv icon

CityGen: Structure-Guided City-Style Synthesis for Cross-City Autonomous Driving

Add code
May 28, 2026
Viaarxiv icon

HarmoWAM: Harmonizing Generalizable and Precise Manipulation via Adaptive World Action Models

Add code
May 11, 2026
Viaarxiv icon

Mask World Model: Predicting What Matters for Robust Robot Policy Learning

Add code
Apr 22, 2026
Viaarxiv icon

OmniForcing: Unleashing Real-time Joint Audio-Visual Generation

Add code
Mar 12, 2026
Viaarxiv icon

WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation

Add code
Oct 08, 2025
Viaarxiv icon

WoW: Towards a World omniscient World model Through Embodied Interaction

Add code
Sep 26, 2025
Viaarxiv icon

DualDiff: Dual-branch Diffusion Model for Autonomous Driving with Semantic Fusion

Add code
May 03, 2025
Viaarxiv icon

DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance

Add code
Mar 05, 2025
Figure 1 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 2 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 3 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Figure 4 for DualDiff+: Dual-Branch Diffusion for High-Fidelity Video Generation with Reward Guidance
Viaarxiv icon