Picture for Chengming Xu

Chengming Xu

Evolution of Optimization Methods: Algorithms, Scenarios, and Evaluations

Add code
Apr 14, 2026
Viaarxiv icon

The Latent Space: Foundation, Evolution, Mechanism, Ability, and Outlook

Add code
Apr 02, 2026
Viaarxiv icon

Dual Latent Memory for Visual Multi-agent System

Add code
Jan 31, 2026
Viaarxiv icon

FFP-300K: Scaling First-Frame Propagation for Generalizable Video Editing

Add code
Jan 06, 2026
Viaarxiv icon

LongVie 2: Multimodal Controllable Ultra-Long Video World Model

Add code
Dec 15, 2025
Viaarxiv icon

VisMem: Latent Vision Memory Unlocks Potential of Vision-Language Models

Add code
Nov 14, 2025
Viaarxiv icon

SwiftVideo: A Unified Framework for Few-Step Video Generation through Trajectory-Distribution Alignment

Add code
Aug 08, 2025
Viaarxiv icon

Transferable Adversarial Attacks on Black-Box Vision-Language Models

Add code
May 02, 2025
Figure 1 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 2 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 3 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Figure 4 for Transferable Adversarial Attacks on Black-Box Vision-Language Models
Viaarxiv icon

Disentangle Identity, Cooperate Emotion: Correlation-Aware Emotional Talking Portrait Generation

Add code
Apr 25, 2025
Viaarxiv icon

CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors

Add code
Feb 20, 2025
Figure 1 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 2 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 3 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Figure 4 for CrossVTON: Mimicking the Logic Reasoning on Cross-category Virtual Try-on guided by Tri-zone Priors
Viaarxiv icon