Alert button
Picture for Jiashuo Yu

Jiashuo Yu

Alert button

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Add code
Bookmark button
Alert button
Mar 22, 2024
Yi Wang, Kunchang Li, Xinhao Li, Jiashuo Yu, Yinan He, Guo Chen, Baoqi Pei, Rongkun Zheng, Jilan Xu, Zun Wang, Yansong Shi, Tianxiang Jiang, Songze Li, Hongjie Zhang, Yifei Huang, Yu Qiao, Yali Wang, Limin Wang

Viaarxiv icon

VBench: Comprehensive Benchmark Suite for Video Generative Models

Add code
Bookmark button
Alert button
Nov 29, 2023
Ziqi Huang, Yinan He, Jiashuo Yu, Fan Zhang, Chenyang Si, Yuming Jiang, Yuanhan Zhang, Tianxing Wu, Qingyang Jin, Nattapol Chanpaisit, Yaohui Wang, Xinyuan Chen, Limin Wang, Dahua Lin, Yu Qiao, Ziwei Liu

Viaarxiv icon

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Add code
Bookmark button
Alert button
Nov 06, 2023
Xinyuan Chen, Yaohui Wang, Lingjun Zhang, Shaobin Zhuang, Xin Ma, Jiashuo Yu, Yali Wang, Dahua Lin, Yu Qiao, Ziwei Liu

Viaarxiv icon

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

Add code
Bookmark button
Alert button
Sep 27, 2023
Yaohui Wang, Xinyuan Chen, Xin Ma, Shangchen Zhou, Ziqi Huang, Yi Wang, Ceyuan Yang, Yinan He, Jiashuo Yu, Peiqing Yang, Yuwei Guo, Tianxing Wu, Chenyang Si, Yuming Jiang, Cunjian Chen, Chen Change Loy, Bo Dai, Dahua Lin, Yu Qiao, Ziwei Liu

Figure 1 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 2 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 3 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Figure 4 for LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models
Viaarxiv icon

InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation

Add code
Bookmark button
Alert button
Jul 13, 2023
Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 2 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 3 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Figure 4 for InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation
Viaarxiv icon

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Add code
Bookmark button
Alert button
May 11, 2023
Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Figure 1 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 2 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 3 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 4 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Viaarxiv icon

Long-Term Rhythmic Video Soundtracker

Add code
Bookmark button
Alert button
May 02, 2023
Jiashuo Yu, Yaohui Wang, Xinyuan Chen, Xiao Sun, Yu Qiao

Figure 1 for Long-Term Rhythmic Video Soundtracker
Figure 2 for Long-Term Rhythmic Video Soundtracker
Figure 3 for Long-Term Rhythmic Video Soundtracker
Figure 4 for Long-Term Rhythmic Video Soundtracker
Viaarxiv icon

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Add code
Bookmark button
Alert button
Dec 07, 2022
Yi Wang, Kunchang Li, Yizhuo Li, Yinan He, Bingkun Huang, Zhiyu Zhao, Hongjie Zhang, Jilan Xu, Yi Liu, Zun Wang, Sen Xing, Guo Chen, Junting Pan, Jiashuo Yu, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 2 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 3 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Figure 4 for InternVideo: General Video Foundation Models via Generative and Discriminative Learning
Viaarxiv icon

InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges

Add code
Bookmark button
Alert button
Nov 17, 2022
Guo Chen, Sen Xing, Zhe Chen, Yi Wang, Kunchang Li, Yizhuo Li, Yi Liu, Jiahao Wang, Yin-Dong Zheng, Bingkun Huang, Zhiyu Zhao, Junting Pan, Yifei Huang, Zun Wang, Jiashuo Yu, Yinan He, Hongjie Zhang, Tong Lu, Yali Wang, Limin Wang, Yu Qiao

Figure 1 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 2 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 3 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Figure 4 for InternVideo-Ego4D: A Pack of Champion Solutions to Ego4D Challenges
Viaarxiv icon