Picture for Huchuan Lu

Huchuan Lu

Ego-InBetween: Generating Object State Transitions in Ego-Centric Videos

Add code
Apr 20, 2026
Viaarxiv icon

Seek-and-Solve: Benchmarking MLLMs for Visual Clue-Driven Reasoning in Daily Scenarios

Add code
Apr 16, 2026
Viaarxiv icon

Selective Noise Suppression and Discriminative Mutual Interaction for Robust Audio-Visual Segmentation

Add code
Mar 15, 2026
Viaarxiv icon

RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation

Add code
Mar 04, 2026
Viaarxiv icon

UETrack: A Unified and Efficient Framework for Single Object Tracking

Add code
Mar 03, 2026
Viaarxiv icon

Eva-Tracker: ESDF-update-free, Visibility-aware Planning with Target Reacquisition for Robust Aerial Tracking

Add code
Feb 13, 2026
Viaarxiv icon

Revisiting Salient Object Detection from an Observer-Centric Perspective

Add code
Feb 06, 2026
Viaarxiv icon

Interactive Spatial-Frequency Fusion Mamba for Multi-Modal Image Fusion

Add code
Feb 04, 2026
Viaarxiv icon

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

Add code
Feb 04, 2026
Viaarxiv icon

Think3D: Thinking with Space for Spatial Reasoning

Add code
Jan 19, 2026
Viaarxiv icon