Alert button
Picture for Mike Zheng Shou

Mike Zheng Shou

Alert button

COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Add code
Bookmark button
Alert button
Jan 01, 2024
Alex Jinpeng Wang, Linjie Li, Kevin Qinghong Lin, Jianfeng Wang, Kevin Lin, Zhengyuan Yang, Lijuan Wang, Mike Zheng Shou

Viaarxiv icon

ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation

Add code
Bookmark button
Alert button
Jan 01, 2024
Difei Gao, Lei Ji, Zechen Bai, Mingyu Ouyang, Peiran Li, Dongxing Mao, Qinchen Wu, Weichen Zhang, Peiyi Wang, Xiangwu Guo, Hengxu Wang, Luowei Zhou, Mike Zheng Shou

Viaarxiv icon

Parrot Captions Teach CLIP to Spot Text

Add code
Bookmark button
Alert button
Dec 28, 2023
Yiqi Lin, Conghui He, Alex Jinpeng Wang, Bin Wang, Weijia Li, Mike Zheng Shou

Viaarxiv icon

MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance

Add code
Bookmark button
Alert button
Dec 21, 2023
Qi Mao, Lan Chen, Yuchao Gu, Zhen Fang, Mike Zheng Shou

Viaarxiv icon

ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors

Add code
Bookmark button
Alert button
Dec 20, 2023
Weijia Mao, Yan-Pei Cao, Jia-Wei Liu, Zhongcong Xu, Mike Zheng Shou

Viaarxiv icon

Genixer: Empowering Multimodal Large Language Models as a Powerful Data Generator

Add code
Bookmark button
Alert button
Dec 11, 2023
Henry Hengyuan Zhao, Pan Zhou, Mike Zheng Shou

Viaarxiv icon

X-Adapter: Adding Universal Compatibility of Plugins for Upgraded Diffusion Model

Add code
Bookmark button
Alert button
Dec 06, 2023
Lingmin Ran, Xiaodong Cun, Jia-Wei Liu, Rui Zhao, Song Zijie, Xintao Wang, Jussi Keppo, Mike Zheng Shou

Viaarxiv icon

VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence

Add code
Bookmark button
Alert button
Dec 05, 2023
Yuchao Gu, Yipin Zhou, Bichen Wu, Licheng Yu, Jia-Wei Liu, Rui Zhao, Jay Zhangjie Wu, David Junhao Zhang, Mike Zheng Shou, Kevin Tang

Viaarxiv icon