Picture for Qingdong He

Qingdong He

The devil is in the details: Enhancing Video Virtual Try-On via Keyframe-Driven Details Injection

Add code
Dec 23, 2025
Viaarxiv icon

CareCom: Generative Image Composition with Calibrated Reference Features

Add code
Nov 14, 2025
Viaarxiv icon

Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning

Add code
Jul 02, 2025
Viaarxiv icon

KAN or MLP? Point Cloud Shows the Way Forward

Add code
Apr 18, 2025
Viaarxiv icon

UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer

Add code
Mar 12, 2025
Figure 1 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 2 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 3 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Figure 4 for UniCombine: Unified Multi-Conditional Combination with Diffusion Transformer
Viaarxiv icon

PixelPonder: Dynamic Patch Adaptation for Enhanced Multi-Conditional Text-to-Image Generation

Add code
Mar 09, 2025
Viaarxiv icon

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Add code
Dec 04, 2024
Figure 1 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 2 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 3 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 4 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Viaarxiv icon

Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Add code
Nov 26, 2024
Figure 1 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 2 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 3 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 4 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Viaarxiv icon

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation

Add code
Nov 25, 2024
Figure 1 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 2 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 3 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 4 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Viaarxiv icon

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Add code
Nov 22, 2024
Figure 1 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 2 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 3 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 4 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Viaarxiv icon