Picture for Haoang Li

Haoang Li

DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models

Add code
Mar 23, 2026
Viaarxiv icon

DyGeoVLN: Infusing Dynamic Geometry Foundation Model into Vision-Language Navigation

Add code
Mar 22, 2026
Viaarxiv icon

VAMPO: Policy Optimization for Improving Visual Dynamics in Video Action Models

Add code
Mar 19, 2026
Viaarxiv icon

AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation

Add code
Mar 18, 2026
Viaarxiv icon

P$^{3}$Nav: End-to-End Perception, Prediction and Planning for Vision-and-Language Navigation

Add code
Mar 18, 2026
Viaarxiv icon

S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight

Add code
Mar 17, 2026
Viaarxiv icon

Global Truncated Loss Minimization for Robust and Threshold-Resilient Geometric Estimation

Add code
Mar 16, 2026
Viaarxiv icon

A Unified Calibration Framework for Coordinate and Kinematic Parameters in Dual-Arm Robots

Add code
Mar 16, 2026
Viaarxiv icon

RegFormer++: An Efficient Large-Scale 3D LiDAR Point Registration Network with Projection-Aware 2D Transformer

Add code
Mar 15, 2026
Viaarxiv icon

Omni-Manip: Beyond-FOV Large-Workspace Humanoid Manipulation with Omnidirectional 3D Perception

Add code
Mar 05, 2026
Viaarxiv icon