Picture for Dongxu Yue

Dongxu Yue

UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation

Add code
Jul 03, 2025
Viaarxiv icon

A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing

Add code
Dec 10, 2023
Figure 1 for A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Figure 2 for A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Figure 3 for A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Figure 4 for A Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video Editing
Viaarxiv icon

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Add code
May 24, 2023
Viaarxiv icon