Picture for Zheng Zhu

Zheng Zhu

Tencent, WeChat Pay

ReplicateAnyScene: Zero-Shot Video-to-3D Composition via Textual-Visual-Spatial Alignment

Add code
Apr 12, 2026
Viaarxiv icon

VAG: Dual-Stream Video-Action Generation for Embodied Data Synthesis

Add code
Apr 10, 2026
Viaarxiv icon

ReconPhys: Reconstruct Appearance and Physical Attributes from Single Video

Add code
Apr 09, 2026
Viaarxiv icon

ViVa: A Video-Generative Value Model for Robot Reinforcement Learning

Add code
Apr 09, 2026
Viaarxiv icon

DriveDreamer-Policy: A Geometry-Grounded World-Action Model for Unified Generation and Planning

Add code
Apr 02, 2026
Viaarxiv icon

FlashSign: Pose-Free Guidance for Efficient Sign Language Video Generation

Add code
Mar 30, 2026
Viaarxiv icon

Vega: Learning to Drive with Natural Language Instructions

Add code
Mar 26, 2026
Viaarxiv icon

2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction

Add code
Mar 23, 2026
Viaarxiv icon

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Add code
Mar 18, 2026
Viaarxiv icon

Spectral Defense Against Resource-Targeting Attack in 3D Gaussian Splatting

Add code
Mar 13, 2026
Viaarxiv icon