Alert button
Picture for Yali Wang

Yali Wang

Alert button

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Add code
Bookmark button
Alert button
Nov 06, 2023
Xinyuan Chen, Yaohui Wang, Lingjun Zhang, Shaobin Zhuang, Xin Ma, Jiashuo Yu, Yali Wang, Dahua Lin, Yu Qiao, Ziwei Liu

Viaarxiv icon

Harvest Video Foundation Models via Efficient Post-Pretraining

Add code
Bookmark button
Alert button
Oct 30, 2023
Yizhuo Li, Kunchang Li, Yinan He, Yi Wang, Yali Wang, Limin Wang, Yu Qiao, Ping Luo

Viaarxiv icon

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Add code
Bookmark button
Alert button
Jul 13, 2023
Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 2 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 3 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 4 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Viaarxiv icon

VideoLLM: Modeling Video Sequence with Large Language Models

Add code
Bookmark button
Alert button
May 23, 2023
Guo Chen, Yin-Dong Zheng, Jiahao Wang, Jilan Xu, Yifei Huang, Junting Pan, Yi Wang, Yali Wang, Yu Qiao, Tong Lu, Limin Wang

Figure 1 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 2 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 3 for VideoLLM: Modeling Video Sequence with Large Language Models
Figure 4 for VideoLLM: Modeling Video Sequence with Large Language Models
Viaarxiv icon

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Add code
Bookmark button
Alert button
May 11, 2023
Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Figure 1 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 2 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 3 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 4 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Viaarxiv icon

VideoChat: Chat-Centric Video Understanding

Add code
Bookmark button
Alert button
May 10, 2023
KunChang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for VideoChat: Chat-Centric Video Understanding
Figure 2 for VideoChat: Chat-Centric Video Understanding
Figure 3 for VideoChat: Chat-Centric Video Understanding
Figure 4 for VideoChat: Chat-Centric Video Understanding
Viaarxiv icon

VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking

Add code
Bookmark button
Alert button
Apr 18, 2023
Limin Wang, Bingkun Huang, Zhiyu Zhao, Zhan Tong, Yinan He, Yi Wang, Yali Wang, Yu Qiao

Figure 1 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Figure 2 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Figure 3 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Figure 4 for VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
Viaarxiv icon

Unmasked Teacher: Towards Training-Efficient Video Foundation Models

Add code
Bookmark button
Alert button
Mar 28, 2023
Kunchang Li, Yali Wang, Yizhuo Li, Yi Wang, Yinan He, Limin Wang, Yu Qiao

Figure 1 for Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Figure 2 for Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Figure 3 for Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Figure 4 for Unmasked Teacher: Towards Training-Efficient Video Foundation Models
Viaarxiv icon