Picture for Zheng Zhu

Zheng Zhu

Tencent, WeChat Pay

DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning

Add code
Apr 02, 2026
Viaarxiv icon

FlashSign: Pose-Free Guidance for Efficient Sign Language Video Generation

Add code
Mar 30, 2026
Viaarxiv icon

Vega: Learning to Drive with Natural Language Instructions

Add code
Mar 26, 2026
Viaarxiv icon

2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction

Add code
Mar 23, 2026
Viaarxiv icon

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Add code
Mar 18, 2026
Viaarxiv icon

Spectral Defense Against Resource-Targeting Attack in 3D Gaussian Splatting

Add code
Mar 13, 2026
Viaarxiv icon

$π$-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

Add code
Mar 02, 2026
Viaarxiv icon

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving

Add code
Feb 02, 2026
Viaarxiv icon

Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants

Add code
Jan 14, 2026
Viaarxiv icon