Picture for Henghui Ding

Henghui Ding

OcclusionFormer: Arranging Z-Order for Layout-Grounded Image Generation

Add code
May 20, 2026
Viaarxiv icon

ROSE: Retrieval-Oriented Segmentation Enhancement

Add code
Apr 15, 2026
Viaarxiv icon

PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow

Add code
Mar 26, 2026
Viaarxiv icon

EffectErase: Joint Video Object Removal and Insertion for High-Quality Effect Erasing

Add code
Mar 19, 2026
Viaarxiv icon

GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering

Add code
Mar 16, 2026
Viaarxiv icon

AutoFly: Vision-Language-Action Model for UAV Autonomous Navigation in the Wild

Add code
Feb 10, 2026
Viaarxiv icon

FMBench: Adaptive Large Language Model Output Formatting

Add code
Feb 06, 2026
Viaarxiv icon

Audit After Segmentation: Reference-Free Mask Quality Assessment for Language-Referred Audio-Visual Segmentation

Add code
Feb 03, 2026
Viaarxiv icon

SAM3-DMS: Decoupled Memory Selection for Multi-target Video Segmentation of SAM3

Add code
Jan 14, 2026
Viaarxiv icon

MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation

Add code
Dec 11, 2025
Figure 1 for MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation
Figure 2 for MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation
Figure 3 for MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation
Figure 4 for MeViS: A Multi-Modal Dataset for Referring Motion Expression Video Segmentation
Viaarxiv icon