Picture for Zhiheng Ma

Zhiheng Ma

Learning Action Manifold with Multi-view Latent Priors for Robotic Manipulation

Add code
May 12, 2026
Viaarxiv icon

Beyond World-Frame Action Heads: Motion-Centric Action Frames for Vision-Language-Action Models

Add code
May 12, 2026
Viaarxiv icon

Retrieve-then-Steer: Online Success Memory for Test-Time Adaptation of Generative VLAs

Add code
May 11, 2026
Viaarxiv icon

ALAM: Algebraically Consistent Latent Transitions for Vision-Language-Action Models

Add code
May 11, 2026
Viaarxiv icon

Continuous Expert Assembly: Instance-Conditioned Low-Rank Residuals for All-in-One Image Restoration

Add code
May 07, 2026
Viaarxiv icon

ABot-Claw: A Foundation for Persistent, Cooperative, and Self-Evolving Robotic Agents

Add code
Apr 11, 2026
Viaarxiv icon

ABot-PhysWorld: Interactive World Foundation Model for Robotic Manipulation with Physics Alignment

Add code
Mar 24, 2026
Viaarxiv icon

HumanOmni-Speaker: Identifying Who said What and When

Add code
Mar 23, 2026
Viaarxiv icon

Trajectory-Diversity-Driven Robust Vision-and-Language Navigation

Add code
Mar 16, 2026
Viaarxiv icon

Neural Implicit Action Fields: From Discrete Waypoints to Continuous Functions for Vision-Language-Action Models

Add code
Mar 02, 2026
Viaarxiv icon