Alert button
Picture for Kunyu Shi

Kunyu Shi

Alert button

Enhancing Vision-Language Pre-training with Rich Supervisions

Add code
Bookmark button
Alert button
Mar 05, 2024
Yuan Gao, Kunyu Shi, Pengkai Zhu, Edouard Belval, Oren Nuriel, Srikar Appalaraju, Shabnam Ghadar, Vijay Mahadevan, Zhuowen Tu, Stefano Soatto

Figure 1 for Enhancing Vision-Language Pre-training with Rich Supervisions
Figure 2 for Enhancing Vision-Language Pre-training with Rich Supervisions
Figure 3 for Enhancing Vision-Language Pre-training with Rich Supervisions
Figure 4 for Enhancing Vision-Language Pre-training with Rich Supervisions
Viaarxiv icon

Non-autoregressive Sequence-to-Sequence Vision-Language Models

Add code
Bookmark button
Alert button
Mar 04, 2024
Kunyu Shi, Qi Dong, Luis Goncalves, Zhuowen Tu, Stefano Soatto

Figure 1 for Non-autoregressive Sequence-to-Sequence Vision-Language Models
Figure 2 for Non-autoregressive Sequence-to-Sequence Vision-Language Models
Figure 3 for Non-autoregressive Sequence-to-Sequence Vision-Language Models
Figure 4 for Non-autoregressive Sequence-to-Sequence Vision-Language Models
Viaarxiv icon

Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts

Add code
Bookmark button
Alert button
May 11, 2023
Zhaoyang Zhang, Yantao Shen, Kunyu Shi, Zhaowei Cai, Jun Fang, Siqi Deng, Hao Yang, Davide Modolo, Zhuowen Tu, Stefano Soatto

Figure 1 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Figure 2 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Figure 3 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Figure 4 for Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts
Viaarxiv icon