Picture for Zheng Zhu

Zheng Zhu

Tencent, WeChat Pay

Vega: Learning to Drive with Natural Language Instructions

Add code
Mar 26, 2026
Viaarxiv icon

2K Retrofit: Entropy-Guided Efficient Sparse Refinement for High-Resolution 3D Geometry Prediction

Add code
Mar 23, 2026
Viaarxiv icon

GigaWorld-Policy: An Efficient Action-Centered World--Action Model

Add code
Mar 18, 2026
Viaarxiv icon

Spectral Defense Against Resource-Targeting Attack in 3D Gaussian Splatting

Add code
Mar 13, 2026
Viaarxiv icon

$π$-StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs

Add code
Mar 02, 2026
Viaarxiv icon

GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning

Add code
Feb 12, 2026
Viaarxiv icon

UniDriveDreamer: A Single-Stage Multimodal World Model for Autonomous Driving

Add code
Feb 02, 2026
Viaarxiv icon

Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants

Add code
Jan 14, 2026
Viaarxiv icon

Spatial Multi-Task Learning for Breast Cancer Molecular Subtype Prediction from Single-Phase DCE-MRI

Add code
Jan 11, 2026
Viaarxiv icon

TokenSeg: Efficient 3D Medical Image Segmentation via Hierarchical Visual Token Compression

Add code
Jan 08, 2026
Viaarxiv icon