Alert button
Picture for Weiyun Wang

Weiyun Wang

Alert button

How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites

Add code
Bookmark button
Alert button
Apr 29, 2024
Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang

Figure 1 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 2 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 3 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Figure 4 for How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites
Viaarxiv icon

Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures

Add code
Bookmark button
Alert button
Mar 07, 2024
Yuchen Duan, Weiyun Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Hongsheng Li, Jifeng Dai, Wenhai Wang

Figure 1 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 2 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 3 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Figure 4 for Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures
Viaarxiv icon

The All-Seeing Project V2: Towards General Relation Comprehension of the Open World

Add code
Bookmark button
Alert button
Feb 29, 2024
Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai

Viaarxiv icon

MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer

Add code
Bookmark button
Alert button
Jan 18, 2024
Changyao Tian, Xizhou Zhu, Yuwen Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Yuntao Chen, Lewei Lu, Tong Lu, Jie Zhou, Hongsheng Li, Yu Qiao, Jifeng Dai

Viaarxiv icon

The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World

Add code
Bookmark button
Alert button
Aug 03, 2023
Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao

Figure 1 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Figure 2 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Figure 3 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Figure 4 for The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World
Viaarxiv icon

InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language

Add code
Bookmark button
Alert button
May 11, 2023
Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao

Figure 1 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 2 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 3 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Figure 4 for InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language
Viaarxiv icon

Demystify Transformers & Convolutions in Modern Image Deep Networks

Add code
Bookmark button
Alert button
Nov 10, 2022
Jifeng Dai, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhu, Lewei Lu, Jie Zhou, Xiaogang Wang, Yu Qiao, Xiaowei Hu

Figure 1 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Figure 2 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Figure 3 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Figure 4 for Demystify Transformers & Convolutions in Modern Image Deep Networks
Viaarxiv icon