Alert button
Picture for Shiwei Zhang

Shiwei Zhang

Alert button

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

Dec 25, 2023
Xiang Wang, Shiwei Zhang, Hangjie Yuan, Zhiwu Qing, Biao Gong, Yingya Zhang, Yujun Shen, Changxin Gao, Nong Sang

Viaarxiv icon

InstructVideo: Instructing Video Diffusion Models with Human Feedback

Dec 19, 2023
Hangjie Yuan, Shiwei Zhang, Xiang Wang, Yujie Wei, Tao Feng, Yining Pan, Yingya Zhang, Ziwei Liu, Samuel Albanie, Dong Ni

Viaarxiv icon

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Dec 15, 2023
Yifeng Ma, Shiwei Zhang, Jiayu Wang, Xiang Wang, Yingya Zhang, Zhidong Deng

Figure 1 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 2 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 3 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Figure 4 for DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Viaarxiv icon

VideoLCM: Video Latent Consistency Model

Dec 14, 2023
Xiang Wang, Shiwei Zhang, Han Zhang, Yu Liu, Yingya Zhang, Changxin Gao, Nong Sang

Figure 1 for VideoLCM: Video Latent Consistency Model
Figure 2 for VideoLCM: Video Latent Consistency Model
Figure 3 for VideoLCM: Video Latent Consistency Model
Figure 4 for VideoLCM: Video Latent Consistency Model
Viaarxiv icon

Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation

Dec 07, 2023
Zhiwu Qing, Shiwei Zhang, Jiayu Wang, Xiang Wang, Yujie Wei, Yingya Zhang, Changxin Gao, Nong Sang

Figure 1 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 2 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 3 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Figure 4 for Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation
Viaarxiv icon

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

Dec 07, 2023
Yujie Wei, Shiwei Zhang, Zhiwu Qing, Hangjie Yuan, Zhiheng Liu, Yu Liu, Yingya Zhang, Jingren Zhou, Hongming Shan

Viaarxiv icon

Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation

Nov 30, 2023
Biao Gong, Siteng Huang, Yutong Feng, Shiwei Zhang, Yuyuan Li, Yu Liu

Viaarxiv icon

I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models

Nov 07, 2023
Shiwei Zhang, Jiayu Wang, Yingya Zhang, Kang Zhao, Hangjie Yuan, Zhiwu Qin, Xiang Wang, Deli Zhao, Jingren Zhou

Viaarxiv icon

Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue

Oct 17, 2023
Shiwei Zhang, Mingfang Wu, Xiuzhen Zhang

Figure 1 for Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue
Figure 2 for Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue
Figure 3 for Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue
Figure 4 for Utilising a Large Language Model to Annotate Subject Metadata: A Case Study in an Australian National Research Data Catalogue
Viaarxiv icon

Few-shot Action Recognition with Captioning Foundation Models

Oct 16, 2023
Xiang Wang, Shiwei Zhang, Hangjie Yuan, Yingya Zhang, Changxin Gao, Deli Zhao, Nong Sang

Figure 1 for Few-shot Action Recognition with Captioning Foundation Models
Figure 2 for Few-shot Action Recognition with Captioning Foundation Models
Figure 3 for Few-shot Action Recognition with Captioning Foundation Models
Figure 4 for Few-shot Action Recognition with Captioning Foundation Models
Viaarxiv icon