Picture for Yanhong Zeng

Yanhong Zeng

AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models

Add code
May 26, 2025
Viaarxiv icon

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment

Add code
May 16, 2025
Viaarxiv icon

WORLDMEM: Long-term Consistent World Simulation with Memory

Add code
Apr 16, 2025
Viaarxiv icon

Multi-identity Human Image Animation with Structural Video Diffusion

Add code
Apr 05, 2025
Viaarxiv icon

LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning?

Add code
Mar 25, 2025
Viaarxiv icon

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Add code
Dec 10, 2024
Figure 1 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Figure 2 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Figure 3 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Figure 4 for DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
Viaarxiv icon

HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation

Add code
Jul 28, 2024
Figure 1 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 2 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 3 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Figure 4 for HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation
Viaarxiv icon

Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models

Add code
Jul 11, 2024
Viaarxiv icon

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds

Add code
Jul 01, 2024
Figure 1 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Figure 2 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Figure 3 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Figure 4 for FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds
Viaarxiv icon

StyleShot: A Snapshot on Any Style

Add code
Jul 01, 2024
Viaarxiv icon