Picture for Haoming Song

Haoming Song

Ego to World: Collaborative Spatial Reasoning in Embodied Systems via Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video

Add code
Mar 10, 2026
Viaarxiv icon

PocketDP3: Efficient Pocket-Scale 3D Visuomotor Policy

Add code
Jan 29, 2026
Viaarxiv icon

Information Filtering via Variational Regularization for Robot Manipulation

Add code
Jan 29, 2026
Viaarxiv icon

Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge

Add code
Jan 26, 2026
Viaarxiv icon

Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation

Add code
Dec 11, 2025
Viaarxiv icon

FastUMI-100K: Advancing Data-driven Robotic Manipulation with a Large-scale UMI-style Dataset

Add code
Oct 09, 2025
Viaarxiv icon

Trajectory Conditioned Cross-embodiment Skill Transfer

Add code
Oct 09, 2025
Viaarxiv icon

F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions

Add code
Sep 09, 2025
Figure 1 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Figure 2 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Figure 3 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Figure 4 for F1: A Vision-Language-Action Model Bridging Understanding and Generation to Actions
Viaarxiv icon

EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control

Add code
Aug 28, 2025
Viaarxiv icon