Picture for Zezhi Liu

Zezhi Liu

VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model

Add code
Feb 14, 2026
Viaarxiv icon

A General One-Shot Multimodal Active Perception Framework for Robotic Manipulation: Learning to Predict Optimal Viewpoint

Add code
Jan 20, 2026
Viaarxiv icon