Alert button
Picture for Ying Shan

Ying Shan

Alert button

Towards A Better Metric for Text-to-Video Generation

Add code
Bookmark button
Alert button
Jan 15, 2024
Jay Zhangjie Wu, Guian Fang, Haoning Wu, Xintao Wang, Yixiao Ge, Xiaodong Cun, David Junhao Zhang, Jia-Wei Liu, Yuchao Gu, Rui Zhao, Weisi Lin, Wynne Hsu, Ying Shan, Mike Zheng Shou

Viaarxiv icon

LLaMA Pro: Progressive LLaMA with Block Expansion

Add code
Bookmark button
Alert button
Jan 04, 2024
Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan

Viaarxiv icon

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Add code
Bookmark button
Alert button
Dec 14, 2023
Jinguo Zhu, Xiaohan Ding, Yixiao Ge, Yuying Ge, Sijie Zhao, Hengshuang Zhao, Xiaohua Wang, Ying Shan

Viaarxiv icon

SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language Models

Add code
Bookmark button
Alert button
Dec 11, 2023
Yuzhou Huang, Liangbin Xie, Xintao Wang, Ziyang Yuan, Xiaodong Cun, Yixiao Ge, Jiantao Zhou, Chao Dong, Rui Huang, Ruimao Zhang, Ying Shan

Viaarxiv icon

EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models

Add code
Bookmark button
Alert button
Dec 11, 2023
Yi Chen, Yuying Ge, Yixiao Ge, Mingyu Ding, Bohao Li, Rui Wang, Ruifeng Xu, Ying Shan, Xihui Liu

Figure 1 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Figure 2 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Figure 3 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Figure 4 for EgoPlan-Bench: Benchmarking Egocentric Embodied Planning with Multimodal Large Language Models
Viaarxiv icon

neural concatenative singing voice conversion: rethinking concatenation-based approach for one-shot singing voice conversion

Add code
Bookmark button
Alert button
Dec 08, 2023
Binzhu Sha, Xu Li, Zhiyong Wu, Ying Shan, Helen Meng

Viaarxiv icon

PhotoMaker: Customizing Realistic Human Photos via Stacked ID Embedding

Add code
Bookmark button
Alert button
Dec 07, 2023
Zhen Li, Mingdeng Cao, Xintao Wang, Zhongang Qi, Ming-Ming Cheng, Ying Shan

Viaarxiv icon

MotionCtrl: A Unified and Flexible Motion Controller for Video Generation

Add code
Bookmark button
Alert button
Dec 06, 2023
Zhouxia Wang, Ziyang Yuan, Xintao Wang, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan

Viaarxiv icon

AnimateZero: Video Diffusion Models are Zero-Shot Image Animators

Add code
Bookmark button
Alert button
Dec 06, 2023
Jiwen Yu, Xiaodong Cun, Chenyang Qi, Yong Zhang, Xintao Wang, Ying Shan, Jian Zhang

Viaarxiv icon

MagicStick: Controllable Video Editing via Control Handle Transformations

Add code
Bookmark button
Alert button
Dec 05, 2023
Yue Ma, Xiaodong Cun, Yingqing He, Chenyang Qi, Xintao Wang, Ying Shan, Xiu Li, Qifeng Chen

Figure 1 for MagicStick: Controllable Video Editing via Control Handle Transformations
Figure 2 for MagicStick: Controllable Video Editing via Control Handle Transformations
Figure 3 for MagicStick: Controllable Video Editing via Control Handle Transformations
Figure 4 for MagicStick: Controllable Video Editing via Control Handle Transformations
Viaarxiv icon