Picture for Haocheng Feng

Haocheng Feng

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Add code
Mar 24, 2026
Viaarxiv icon

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Add code
Mar 19, 2026
Viaarxiv icon

MVHOI: Bridge Multi-view Condition to Complex Human-Object Interaction Video Reenactment via 3D Foundation Model

Add code
Mar 16, 2026
Viaarxiv icon

DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary

Add code
Mar 10, 2026
Viaarxiv icon

RnG: A Unified Transformer for Complete 3D Modeling from Partial Observations

Add code
Mar 01, 2026
Viaarxiv icon

CoLoGen: Progressive Learning of Concept-Localization Duality for Unified Image Generation

Add code
Feb 26, 2026
Viaarxiv icon

Query-Kontext: An Unified Multimodal Model for Image Generation and Editing

Add code
Sep 30, 2025
Figure 1 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Figure 2 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Figure 3 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Figure 4 for Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Viaarxiv icon

iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer

Add code
Jun 15, 2025
Viaarxiv icon

AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers

Add code
Mar 25, 2025
Figure 1 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Figure 2 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Figure 3 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Figure 4 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Viaarxiv icon

Cosh-DiT: Co-Speech Gesture Video Synthesis via Hybrid Audio-Visual Diffusion Transformers

Add code
Mar 13, 2025
Viaarxiv icon