Picture for Yan Huang

Yan Huang

Efficient-WAM: A 1B-Parameter World-Action Model with Low-Cost Future Imagination

Add code
Jun 08, 2026
Viaarxiv icon

WAM-Nav: Asymmetric Latent World-Action Modeling for Unified Visual Navigation

Add code
Jun 03, 2026
Viaarxiv icon

When Seeing Is Not Believing -- A Benchmark for Search-Grounded Video Misinformation Detection

Add code
Jun 02, 2026
Viaarxiv icon

SKIP: Sparse Keyframe Interpolation Paradigm for Efficient Embodied World Models

Add code
May 30, 2026
Viaarxiv icon

PanopticQuery: Unified Query-Time Reasoning for 4D Scenes

Add code
Apr 07, 2026
Viaarxiv icon

Multi-View Video Diffusion Policy: A 3D Spatio-Temporal-Aware Video Action Model

Add code
Apr 03, 2026
Viaarxiv icon

FloorPlan-VLN: A New Paradigm for Floor Plan Guided Vision-Language Navigation

Add code
Mar 18, 2026
Viaarxiv icon

Towards Visual Query Segmentation in the Wild

Add code
Mar 09, 2026
Viaarxiv icon

Towards Long-Form Spatio-Temporal Video Grounding

Add code
Feb 26, 2026
Viaarxiv icon

Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization

Add code
Feb 10, 2026
Viaarxiv icon