Picture for Jiazhi Guan

Jiazhi Guan

ONE-SHOT: Compositional Human-Environment Video Synthesis via Spatial-Decoupled Motion Injection and Hybrid Context Integration

Add code
Apr 01, 2026
Viaarxiv icon

InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance

Add code
Mar 24, 2026
Viaarxiv icon

DISPLAY: Directable Human-Object Interaction Video Generation via Sparse Motion Guidance and Multi-Task Auxiliary

Add code
Mar 10, 2026
Viaarxiv icon

AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers

Add code
Mar 25, 2025
Figure 1 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Figure 2 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Figure 3 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Figure 4 for AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
Viaarxiv icon

Cosh-DiT: Co-Speech Gesture Video Synthesis via Hybrid Audio-Visual Diffusion Transformers

Add code
Mar 13, 2025
Viaarxiv icon

TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model

Add code
Oct 14, 2024
Figure 1 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Figure 2 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Figure 3 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Figure 4 for TALK-Act: Enhance Textural-Awareness for 2D Speaking Avatar Reenactment with Diffusion Model
Viaarxiv icon

ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer

Add code
Aug 06, 2024
Figure 1 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 2 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 3 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Figure 4 for ReSyncer: Rewiring Style-based Generator for Unified Audio-Visually Synced Facial Performer
Viaarxiv icon

Building an Invisible Shield for Your Portrait against Deepfakes

Add code
May 22, 2023
Figure 1 for Building an Invisible Shield for Your Portrait against Deepfakes
Figure 2 for Building an Invisible Shield for Your Portrait against Deepfakes
Figure 3 for Building an Invisible Shield for Your Portrait against Deepfakes
Figure 4 for Building an Invisible Shield for Your Portrait against Deepfakes
Viaarxiv icon

StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator

Add code
May 09, 2023
Figure 1 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Figure 2 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Figure 3 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Figure 4 for StyleSync: High-Fidelity Generalized and Personalized Lip Sync in Style-based Generator
Viaarxiv icon

Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption

Add code
Jul 21, 2022
Figure 1 for Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption
Figure 2 for Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption
Figure 3 for Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption
Figure 4 for Detecting Deepfake by Creating Spatio-Temporal Regularity Disruption
Viaarxiv icon