Alert button
Picture for Chunyuan Li

Chunyuan Li

Alert button

Semantic-SAM: Segment and Recognize Anything at Any Granularity

Add code
Bookmark button
Alert button
Jul 10, 2023
Feng Li, Hao Zhang, Peize Sun, Xueyan Zou, Shilong Liu, Jianwei Yang, Chunyuan Li, Lei Zhang, Jianfeng Gao

Figure 1 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Figure 2 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Figure 3 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Figure 4 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Viaarxiv icon

Large Multimodal Models: Notes on CVPR 2023 Tutorial

Add code
Bookmark button
Alert button
Jun 26, 2023
Chunyuan Li

Figure 1 for Large Multimodal Models: Notes on CVPR 2023 Tutorial
Figure 2 for Large Multimodal Models: Notes on CVPR 2023 Tutorial
Figure 3 for Large Multimodal Models: Notes on CVPR 2023 Tutorial
Figure 4 for Large Multimodal Models: Notes on CVPR 2023 Tutorial
Viaarxiv icon

MIMIC-IT: Multi-Modal In-Context Instruction Tuning

Add code
Bookmark button
Alert button
Jun 08, 2023
Bo Li, Yuanhan Zhang, Liangyu Chen, Jinghao Wang, Fanyi Pu, Jingkang Yang, Chunyuan Li, Ziwei Liu

Figure 1 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Figure 2 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Figure 3 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Figure 4 for MIMIC-IT: Multi-Modal In-Context Instruction Tuning
Viaarxiv icon

LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day

Add code
Bookmark button
Alert button
Jun 01, 2023
Chunyuan Li, Cliff Wong, Sheng Zhang, Naoto Usuyama, Haotian Liu, Jianwei Yang, Tristan Naumann, Hoifung Poon, Jianfeng Gao

Figure 1 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 2 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 3 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Figure 4 for LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Viaarxiv icon

On the Hidden Mystery of OCR in Large Multimodal Models

Add code
Bookmark button
Alert button
May 13, 2023
Yuliang Liu, Zhang Li, Hongliang Li, Wenwen Yu, Mingxin Huang, Dezhi Peng, Mingyu Liu, Mingrui Chen, Chunyuan Li, Lianwen Jin, Xiang Bai

Figure 1 for On the Hidden Mystery of OCR in Large Multimodal Models
Figure 2 for On the Hidden Mystery of OCR in Large Multimodal Models
Figure 3 for On the Hidden Mystery of OCR in Large Multimodal Models
Viaarxiv icon

Towards Building the Federated GPT: Federated Instruction Tuning

Add code
Bookmark button
Alert button
May 09, 2023
Jianyi Zhang, Saeed Vahidian, Martin Kuo, Chunyuan Li, Ruiyi Zhang, Guoyin Wang, Yiran Chen

Figure 1 for Towards Building the Federated GPT: Federated Instruction Tuning
Figure 2 for Towards Building the Federated GPT: Federated Instruction Tuning
Figure 3 for Towards Building the Federated GPT: Federated Instruction Tuning
Figure 4 for Towards Building the Federated GPT: Federated Instruction Tuning
Viaarxiv icon

Visual Instruction Tuning

Add code
Bookmark button
Alert button
Apr 17, 2023
Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee

Figure 1 for Visual Instruction Tuning
Figure 2 for Visual Instruction Tuning
Figure 3 for Visual Instruction Tuning
Figure 4 for Visual Instruction Tuning
Viaarxiv icon

Instruction Tuning with GPT-4

Add code
Bookmark button
Alert button
Apr 06, 2023
Baolin Peng, Chunyuan Li, Pengcheng He, Michel Galley, Jianfeng Gao

Figure 1 for Instruction Tuning with GPT-4
Figure 2 for Instruction Tuning with GPT-4
Figure 3 for Instruction Tuning with GPT-4
Figure 4 for Instruction Tuning with GPT-4
Viaarxiv icon

A Simple Framework for Open-Vocabulary Segmentation and Detection

Add code
Bookmark button
Alert button
Mar 20, 2023
Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang

Figure 1 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Figure 2 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Figure 3 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Figure 4 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Viaarxiv icon

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Add code
Bookmark button
Alert button
Mar 20, 2023
Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Hao Zhang, Jie Yang, Chunyuan Li, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang

Figure 1 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Figure 2 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Figure 3 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Figure 4 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Viaarxiv icon