Alert button
Picture for David Junhao Zhang

David Junhao Zhang

Alert button

DragAnything: Motion Control for Anything using Entity Representation

Add code
Bookmark button
Alert button
Mar 15, 2024
Weijia Wu, Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang

Figure 1 for DragAnything: Motion Control for Anything using Entity Representation
Figure 2 for DragAnything: Motion Control for Anything using Entity Representation
Figure 3 for DragAnything: Motion Control for Anything using Entity Representation
Figure 4 for DragAnything: Motion Control for Anything using Entity Representation
Viaarxiv icon

Towards A Better Metric for Text-to-Video Generation

Add code
Bookmark button
Alert button
Jan 15, 2024
Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

Viaarxiv icon

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Add code
Bookmark button
Alert button
Jan 03, 2024
David Junhao Zhang, Dongxu Li, Hung Le, Mike Zheng Shou, Caiming Xiong, Doyen Sahoo

Viaarxiv icon

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Add code
Bookmark button
Alert button
Dec 05, 2023
Yuchao Gu, Yipin Zhou, Bichen Wu, Licheng Yu, Jia-Wei Liu, Rui Zhao, Jay Zhangjie Wu, David Junhao Zhang, Mike Zheng Shou, Kevin Tang

Viaarxiv icon

MotionDirector: Motion Customization of Text-to-Video Diffusion Models

Add code
Bookmark button
Alert button
Oct 12, 2023
Rui Zhao, Yuchao Gu, Jay Zhangjie Wu, David Junhao Zhang, Jiawei Liu, Weijia Wu, Jussi Keppo, Mike Zheng Shou

Figure 1 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Figure 2 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Figure 3 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Figure 4 for MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Viaarxiv icon

Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation

Add code
Bookmark button
Alert button
Sep 27, 2023
David Junhao Zhang, Jay Zhangjie Wu, Jia-Wei Liu, Rui Zhao, Lingmin Ran, Yuchao Gu, Difei Gao, Mike Zheng Shou

Figure 1 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Figure 2 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Figure 3 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Figure 4 for Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Viaarxiv icon

Dataset Condensation via Generative Model

Add code
Bookmark button
Alert button
Sep 14, 2023
David Junhao Zhang, Heng Wang, Chuhui Xue, Rui Yan, Wenqing Zhang, Song Bai, Mike Zheng Shou

Figure 1 for Dataset Condensation via Generative Model
Figure 2 for Dataset Condensation via Generative Model
Figure 3 for Dataset Condensation via Generative Model
Figure 4 for Dataset Condensation via Generative Model
Viaarxiv icon

Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks

Add code
Bookmark button
Alert button
Aug 13, 2023
David Junhao Zhang, Mutian Xu, Chuhui Xue, Wenqing Zhang, Xiaoguang Han, Song Bai, Mike Zheng Shou

Figure 1 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Figure 2 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Figure 3 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Figure 4 for Free-ATM: Exploring Unsupervised Learning on Diffusion-Generated Images with Free Attention Masks
Viaarxiv icon

Too Large; Data Reduction for Vision-Language Pre-Training

Add code
Bookmark button
Alert button
Jun 01, 2023
Alex Jinpeng Wang, Kevin Qinghong Lin, David Junhao Zhang, Stan Weixian Lei, Mike Zheng Shou

Figure 1 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 2 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 3 for Too Large; Data Reduction for Vision-Language Pre-Training
Figure 4 for Too Large; Data Reduction for Vision-Language Pre-Training
Viaarxiv icon

Making Vision Transformers Efficient from A Token Sparsification View

Add code
Bookmark button
Alert button
Mar 30, 2023
Shuning Chang, Pichao Wang, Ming Lin, Fan Wang, David Junhao Zhang, Rong Jin, Mike Zheng Shou

Figure 1 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 2 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 3 for Making Vision Transformers Efficient from A Token Sparsification View
Figure 4 for Making Vision Transformers Efficient from A Token Sparsification View
Viaarxiv icon