Picture for Donghao Luo

Donghao Luo

Soul: Breathe Life into Digital Human for High-fidelity Long-term Multimodal Animation

Add code
Dec 15, 2025
Viaarxiv icon

Transform Trained Transformer: Accelerating Naive 4K Video Generation Over 10$\times$

Add code
Dec 15, 2025
Viaarxiv icon

OracleAgent: A Multimodal Reasoning Agent for Oracle Bone Script Research

Add code
Oct 30, 2025
Viaarxiv icon

SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment

Add code
Aug 08, 2025
Viaarxiv icon

DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition

Add code
Mar 19, 2025
Figure 1 for DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Figure 2 for DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Figure 3 for DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Figure 4 for DVHGNN: Multi-Scale Dilated Vision HGNN for Efficient Vision Recognition
Viaarxiv icon

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors

Add code
Feb 20, 2025
Figure 1 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 2 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 3 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 4 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Viaarxiv icon

Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration

Add code
Dec 05, 2024
Figure 1 for Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration
Figure 2 for Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration
Figure 3 for Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration
Figure 4 for Exploring Real&Synthetic Dataset and Linear Attention in Image Restoration
Viaarxiv icon

DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation

Add code
Dec 04, 2024
Figure 1 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 2 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 3 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Figure 4 for DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation
Viaarxiv icon

Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing

Add code
Nov 26, 2024
Figure 1 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 2 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 3 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Figure 4 for Unveil Inversion and Invariance in Flow Transformer for Versatile Image Editing
Viaarxiv icon

Sonic: Shifting Focus to Global Audio Perception in Portrait Animation

Add code
Nov 25, 2024
Figure 1 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 2 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 3 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Figure 4 for Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
Viaarxiv icon