Alert button
Picture for Mike Zheng Shou

Mike Zheng Shou

Alert button

Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation

Mar 19, 2024
Jingtao Sun, Yaonan Wang, Mingtao Feng, Chao Ding, Mike Zheng Shou, Ajmal Saeed Mian

Viaarxiv icon

DragAnything: Motion Control for Anything using Entity Representation

Mar 15, 2024
Weijia Wu, Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang

Viaarxiv icon

Bring Your Own Character: A Holistic Solution for Automatic Facial Animation Generation of Customized Characters

Feb 21, 2024
Zechen Bai, Peng Chen, Xiaolan Peng, Lu Liu, Hui Chen, Mike Zheng Shou, Feng Tian

Viaarxiv icon

Skip \n: A Simple Method to Reduce Hallucination in Large Vision-Language Models

Feb 12, 2024
Zongbo Han, Zechen Bai, Haiyang Mei, Qianli Xu, Changqing Zhang, Mike Zheng Shou

Viaarxiv icon

Skip $\textbackslash n$: A simple method to reduce hallucination in Large Vision-Language Models

Feb 02, 2024
Zongbo Han, Zechen Bai, Haiyang Mei, Qianli Xu, Changqing Zhang, Mike Zheng Shou

Viaarxiv icon

Delocate: Detection and Localization for Deepfake Videos with Randomly-Located Tampered Traces

Jan 24, 2024
Juan Hu, Xin Liao, Difei Gao, Satoshi Tsutsui, Qian Wang, Zheng Qin, Mike Zheng Shou

Viaarxiv icon

Towards A Better Metric for Text-to-Video Generation

Jan 15, 2024
Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

Viaarxiv icon

Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions

Jan 03, 2024
David Junhao Zhang, Dongxu Li, Hung Le, Mike Zheng Shou, Caiming Xiong, Doyen Sahoo

Viaarxiv icon

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Jan 01, 2024
Alex Jinpeng Wang, Linjie Li, Kevin Qinghong Lin, Jianfeng Wang, Kevin Lin, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou

Viaarxiv icon