Picture for Teng Wang

Teng Wang

LiveVLN: Breaking the Stop-and-Go Loop in Vision-Language Navigation

Add code
Apr 21, 2026
Viaarxiv icon

Chain-of-Glimpse: Search-Guided Progressive Object-Grounded Reasoning for Video Understanding

Add code
Apr 16, 2026
Viaarxiv icon

OmniScript: Towards Audio-Visual Script Generation for Long-Form Cinematic Video

Add code
Apr 13, 2026
Viaarxiv icon

CycleRL: Sim-to-Real Deep Reinforcement Learning for Robust Autonomous Bicycle Control

Add code
Mar 16, 2026
Viaarxiv icon

AutoTraces: Autoregressive Trajectory Forecasting via Multimodal Large Language Models

Add code
Mar 09, 2026
Viaarxiv icon

AR2-4FV: Anchored Referring and Re-identification for Long-Term Grounding in Fixed-View Videos

Add code
Mar 08, 2026
Viaarxiv icon

PhotoBench: Beyond Visual Matching Towards Personalized Intent-Driven Photo Retrieval

Add code
Mar 02, 2026
Viaarxiv icon

UltraStar: Semantic-Aware Star Graph Modeling for Echocardiography Navigation

Add code
Mar 02, 2026
Viaarxiv icon

DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories

Add code
Feb 11, 2026
Viaarxiv icon

OSCAR: Optimization-Steered Agentic Planning for Composed Image Retrieval

Add code
Feb 09, 2026
Viaarxiv icon