Picture for Henghui Ding

Henghui Ding

Progressive Scaling Visual Object Tracking

Add code
May 26, 2025
Viaarxiv icon

SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models

Add code
May 24, 2025
Viaarxiv icon

Open-set Anomaly Segmentation in Complex Scenarios

Add code
Apr 28, 2025
Viaarxiv icon

PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild

Add code
Apr 15, 2025
Viaarxiv icon

Exploiting Temporal State Space Sharing for Video Semantic Segmentation

Add code
Mar 26, 2025
Viaarxiv icon

QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge

Add code
Mar 20, 2025
Viaarxiv icon

Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions

Add code
Jan 03, 2025
Viaarxiv icon

Hierarchical Alignment-enhanced Adaptive Grounding Network for Generalized Referring Expression Comprehension

Add code
Jan 02, 2025
Viaarxiv icon

LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers

Add code
Dec 17, 2024
Viaarxiv icon

Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models

Add code
Nov 04, 2024
Figure 1 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 2 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 3 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Figure 4 for Continual LLaVA: Continual Instruction Tuning in Large Vision-Language Models
Viaarxiv icon