Picture for Chengming Xu

Chengming Xu

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Add code
Dec 15, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon

SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment

Add code
Aug 08, 2025
Viaarxiv icon

Transferable Adversarial Attacks on Black-Box Vision-Language Models

Add code
May 02, 2025
Figure 1 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 2 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 3 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 4 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Viaarxiv icon

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation

Add code
Apr 25, 2025
Viaarxiv icon

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors

Add code
Feb 20, 2025
Figure 1 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 2 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 3 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 4 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Viaarxiv icon

SVFR: A Unified Framework for Generalized Video Face Restoration

Add code
Jan 03, 2025
Figure 1 for SVFR: A Unified Framework for Generalized Video Face Restoration
Figure 2 for SVFR: A Unified Framework for Generalized Video Face Restoration
Figure 3 for SVFR: A Unified Framework for Generalized Video Face Restoration
Figure 4 for SVFR: A Unified Framework for Generalized Video Face Restoration
Viaarxiv icon

FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on

Add code
Nov 22, 2024
Figure 1 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 2 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 3 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Figure 4 for FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on
Viaarxiv icon

Robust Network Learning via Inverse Scale Variational Sparsification

Add code
Sep 27, 2024
Figure 1 for Robust Network Learning via Inverse Scale Variational Sparsification
Figure 2 for Robust Network Learning via Inverse Scale Variational Sparsification
Figure 3 for Robust Network Learning via Inverse Scale Variational Sparsification
Figure 4 for Robust Network Learning via Inverse Scale Variational Sparsification
Viaarxiv icon

SVP: Style-Enhanced Vivid Portrait Talking Head Diffusion Model

Add code
Sep 05, 2024
Viaarxiv icon