Alert button
Picture for Mike Zheng Shou

Mike Zheng Shou

Alert button

Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
Apr 03, 2024
Wentian Zhang, Haozhe Liu, Jinheng Xie, Francesco Faccio, Mike Zheng Shou, Jürgen Schmidhuber

Viaarxiv icon

Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation

Add code
Bookmark button
Alert button
Mar 19, 2024
Jingtao Sun, Yaonan Wang, Mingtao Feng, Chao Ding, Mike Zheng Shou, Ajmal Saeed Mian

Figure 1 for Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation
Figure 2 for Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation
Figure 3 for Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation
Figure 4 for Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation
Viaarxiv icon

DragAnything: Motion Control for Anything using Entity Representation

Add code
Bookmark button
Alert button
Mar 15, 2024
Weijia Wu, Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang

Figure 1 for DragAnything: Motion Control for Anything using Entity Representation
Figure 2 for DragAnything: Motion Control for Anything using Entity Representation
Figure 3 for DragAnything: Motion Control for Anything using Entity Representation
Figure 4 for DragAnything: Motion Control for Anything using Entity Representation
Viaarxiv icon

Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters

Add code
Bookmark button
Alert button
Feb 21, 2024
Zechen Bai, Peng Chen, Xiaolan Peng, Lu Liu, Hui Chen, Mike Zheng Shou, Feng Tian

Viaarxiv icon

Skip \n: A Simple Method to Reduce Hallucination in Large Vision-Language Models

Add code
Bookmark button
Alert button
Feb 12, 2024
Zongbo Han, Zechen Bai, Haiyang Mei, Qianli Xu, Changqing Zhang, Mike Zheng Shou

Viaarxiv icon

Skip $\textbackslash n$: A simple method to reduce hallucination in Large Vision-Language Models

Add code
Bookmark button
Alert button
Feb 02, 2024
Zongbo Han, Zechen Bai, Haiyang Mei, Qianli Xu, Changqing Zhang, Mike Zheng Shou

Viaarxiv icon

Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered Traces

Add code
Bookmark button
Alert button
Jan 24, 2024
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Qian Wang, Zheng Qin, Mike Zheng Shou

Viaarxiv icon

Towards A Better Metric for Text-to-Video Generation

Add code
Bookmark button
Alert button
Jan 15, 2024
Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

Viaarxiv icon

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Add code
Bookmark button
Alert button
Jan 03, 2024
David Junhao Zhang, Dongxu Li, Hung Le, Mike Zheng Shou, Caiming Xiong, Doyen Sahoo

Viaarxiv icon