Picture for David Junhao Zhang

David Junhao Zhang

DragAnything: Motion Control for Anything using Entity Representation

Add code
Mar 15, 2024
Figure 1 for DragAnything: Motion Control for Anything using Entity Representation
Figure 2 for DragAnything: Motion Control for Anything using Entity Representation
Figure 3 for DragAnything: Motion Control for Anything using Entity Representation
Figure 4 for DragAnything: Motion Control for Anything using Entity Representation
Viaarxiv icon

Towards A Better Metric for Text-to-Video Generation

Add code
Jan 15, 2024
Figure 1 for Towards A Better Metric for Text-to-Video Generation
Figure 2 for Towards A Better Metric for Text-to-Video Generation
Figure 3 for Towards A Better Metric for Text-to-Video Generation
Figure 4 for Towards A Better Metric for Text-to-Video Generation
Viaarxiv icon

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Add code
Jan 03, 2024
Viaarxiv icon

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Add code
Dec 05, 2023
Figure 1 for VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Figure 2 for VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Figure 3 for VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Figure 4 for VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence
Viaarxiv icon

MotionDirector: Motion Customization of Text-to-Video Diffusion Models

Add code
Oct 12, 2023
Figure 1 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Figure 2 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Figure 3 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Figure 4 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Viaarxiv icon

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Add code
Sep 27, 2023
Figure 1 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Figure 2 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Figure 3 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Figure 4 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Viaarxiv icon

Dataset Condensation via Generative Model

Add code
Sep 14, 2023
Figure 1 for Dataset Condensation via Generative Model
Figure 2 for Dataset Condensation via Generative Model
Figure 3 for Dataset Condensation via Generative Model
Figure 4 for Dataset Condensation via Generative Model
Viaarxiv icon

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks

Add code
Aug 13, 2023
Figure 1 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Figure 2 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Figure 3 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Figure 4 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Viaarxiv icon

Too Large; Data Reduction for Vision-Language Pre-Training

Add code
Jun 01, 2023
Figure 1 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 2 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 3 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 4 for Too Large; Data Reduction for Vision-Language Pre-Training
Viaarxiv icon

Making Vision Transformers Efficient from A Token Sparsification View

Add code
Mar 30, 2023
Figure 1 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 2 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 3 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 4 for Making Vision Transformers Efficient from A Token Sparsification View
Viaarxiv icon