Picture for Henghui Ding

Henghui Ding

Multimodal Referring Segmentation: A Survey

Add code
Aug 01, 2025
Viaarxiv icon

Towards Omnimodal Expressions and Reasoning in Referring Audio-Visual Segmentation

Add code
Jul 30, 2025
Viaarxiv icon

MOVE: Motion-Guided Few-Shot Video Object Segmentation

Add code
Jul 29, 2025
Viaarxiv icon

CharaConsist: Fine-Grained Consistent Character Generation

Add code
Jul 15, 2025
Figure 1 for CharaConsist: Fine-Grained Consistent Character Generation
Figure 2 for CharaConsist: Fine-Grained Consistent Character Generation
Figure 3 for CharaConsist: Fine-Grained Consistent Character Generation
Figure 4 for CharaConsist: Fine-Grained Consistent Character Generation
Viaarxiv icon

AnyI2V: Animating Any Conditional Image with Motion Control

Add code
Jul 03, 2025
Viaarxiv icon

Progressive Scaling Visual Object Tracking

Add code
May 26, 2025
Viaarxiv icon

SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models

Add code
May 24, 2025
Viaarxiv icon

Open-set Anomaly Segmentation in Complex Scenarios

Add code
Apr 28, 2025
Viaarxiv icon

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

Add code
Apr 15, 2025
Figure 1 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 2 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 3 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Figure 4 for PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild
Viaarxiv icon

Exploiting Temporal State Space Sharing for Video Semantic Segmentation

Add code
Mar 26, 2025
Viaarxiv icon