Picture for Yingya Zhang

Yingya Zhang

UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation

Add code
Jun 03, 2024
Figure 1 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Figure 2 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Figure 3 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Figure 4 for UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation
Viaarxiv icon

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

Add code
Dec 25, 2023
Viaarxiv icon

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Add code
Dec 19, 2023
Viaarxiv icon

AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis

Add code
Dec 18, 2023
Figure 1 for AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis
Figure 2 for AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis
Figure 3 for AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis
Figure 4 for AE-NeRF: Audio Enhanced Neural Radiance Field for Few Shot Talking Head Synthesis
Viaarxiv icon

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Add code
Dec 15, 2023
Figure 1 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 2 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 3 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 4 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Viaarxiv icon

VideoLCM: Video Latent Consistency Model

Add code
Dec 14, 2023
Figure 1 for VideoLCM: Video Latent Consistency Model
Figure 2 for VideoLCM: Video Latent Consistency Model
Figure 3 for VideoLCM: Video Latent Consistency Model
Figure 4 for VideoLCM: Video Latent Consistency Model
Viaarxiv icon

Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation

Add code
Dec 07, 2023
Figure 1 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 2 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 3 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 4 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Viaarxiv icon

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

Add code
Dec 07, 2023
Viaarxiv icon

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Add code
Nov 07, 2023
Viaarxiv icon

Few-shot Action Recognition with Captioning Foundation Models

Add code
Oct 16, 2023
Figure 1 for Few-shot Action Recognition with Captioning Foundation Models
Figure 2 for Few-shot Action Recognition with Captioning Foundation Models
Figure 3 for Few-shot Action Recognition with Captioning Foundation Models
Figure 4 for Few-shot Action Recognition with Captioning Foundation Models
Viaarxiv icon