Alert button
Picture for Guo Chen

Guo Chen

Alert button

InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks

Jan 15, 2024
Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai

Viaarxiv icon

Retrieval-Augmented Egocentric Video Captioning

Jan 03, 2024
Jilan Xu, Yifei Huang, Junlin Hou, Guo Chen, Yuejie Zhang, Rui Feng, Weidi Xie

Viaarxiv icon

Decoupling SQL Query Hardness Parsing for Text-to-SQL

Dec 29, 2023
Jiawen Yi, Guo Chen

Figure 1 for Decoupling SQL Query Hardness Parsing for Text-to-SQL
Figure 2 for Decoupling SQL Query Hardness Parsing for Text-to-SQL
Figure 3 for Decoupling SQL Query Hardness Parsing for Text-to-SQL
Figure 4 for Decoupling SQL Query Hardness Parsing for Text-to-SQL
Viaarxiv icon

MVBench: A Comprehensive Multi-modal Video Understanding Benchmark

Dec 03, 2023
Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao

Figure 1 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 2 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 3 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Figure 4 for MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
Viaarxiv icon

Memory-and-Anticipation Transformer for Online Action Understanding

Aug 15, 2023
Jiahao Wang, Guo Chen, Yifei Huang, Limin Wang, Tong Lu

Figure 1 for Memory-and-Anticipation Transformer for Online Action Understanding
Figure 2 for Memory-and-Anticipation Transformer for Online Action Understanding
Figure 3 for Memory-and-Anticipation Transformer for Online Action Understanding
Figure 4 for Memory-and-Anticipation Transformer for Online Action Understanding
Viaarxiv icon

AVSegFormer: Audio-Visual Segmentation with Transformer

Jul 05, 2023
Shengyi Gao, Zhe Chen, Guo Chen, Wenhai Wang, Tong Lu

Figure 1 for AVSegFormer: Audio-Visual Segmentation with Transformer
Figure 2 for AVSegFormer: Audio-Visual Segmentation with Transformer
Figure 3 for AVSegFormer: Audio-Visual Segmentation with Transformer
Figure 4 for AVSegFormer: Audio-Visual Segmentation with Transformer
Viaarxiv icon