Picture for Hao Dong

Hao Dong

CorrectNav: Self-Correction Flywheel Empowers Vision-Language-Action Navigation Model

Add code
Aug 14, 2025
Viaarxiv icon

Adapting Vision-Language Models Without Labels: A Comprehensive Survey

Add code
Aug 07, 2025
Viaarxiv icon

ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes

Add code
Jun 17, 2025
Viaarxiv icon

CheckManual: A New Challenge and Benchmark for Manual-based Appliance Manipulation

Add code
Jun 11, 2025
Viaarxiv icon

SR3D: Unleashing Single-view 3D Reconstruction for Transparent and Specular Object Grasping

Add code
May 30, 2025
Viaarxiv icon

To Trust Or Not To Trust Your Vision-Language Model's Prediction

Add code
May 29, 2025
Viaarxiv icon

From Strangers to Assistants: Fast Desire Alignment for Embodied Agent-User Adaptation

Add code
May 28, 2025
Viaarxiv icon

SpikeStereoNet: A Brain-Inspired Framework for Stereo Depth Estimation from Spike Streams

Add code
May 26, 2025
Viaarxiv icon

Extremely Simple Multimodal Outlier Synthesis for Out-of-Distribution Detection and Segmentation

Add code
May 22, 2025
Viaarxiv icon

GCAL: Adapting Graph Models to Evolving Domain Shifts

Add code
May 22, 2025
Viaarxiv icon