Alert button
Picture for Chunyuan Li

Chunyuan Li

Alert button

A Simple Framework for Open-Vocabulary Segmentation and Detection

Add code
Bookmark button
Alert button
Mar 15, 2023
Hao Zhang, Feng Li, Xueyan Zou, Shilong Liu, Chunyuan Li, Jianfeng Gao, Jianwei Yang, Lei Zhang

Figure 1 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Figure 2 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Figure 3 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Figure 4 for A Simple Framework for Open-Vocabulary Segmentation and Detection
Viaarxiv icon

Scaling Vision-Language Models with Sparse Mixture of Experts

Add code
Bookmark button
Alert button
Mar 13, 2023
Sheng Shen, Zhewei Yao, Chunyuan Li, Trevor Darrell, Kurt Keutzer, Yuxiong He

Figure 1 for Scaling Vision-Language Models with Sparse Mixture of Experts
Figure 2 for Scaling Vision-Language Models with Sparse Mixture of Experts
Figure 3 for Scaling Vision-Language Models with Sparse Mixture of Experts
Figure 4 for Scaling Vision-Language Models with Sparse Mixture of Experts
Viaarxiv icon

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Add code
Bookmark button
Alert button
Mar 10, 2023
Shilong Liu, Zhaoyang Zeng, Tianhe Ren, Feng Li, Hao Zhang, Jie Yang, Chunyuan Li, Jianwei Yang, Hang Su, Jun Zhu, Lei Zhang

Figure 1 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Figure 2 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Figure 3 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Figure 4 for Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
Viaarxiv icon

Learning Customized Visual Models with Retrieval-Augmented Knowledge

Add code
Bookmark button
Alert button
Jan 17, 2023
Haotian Liu, Kilho Son, Jianwei Yang, Ce Liu, Jianfeng Gao, Yong Jae Lee, Chunyuan Li

Figure 1 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 2 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 3 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Figure 4 for Learning Customized Visual Models with Retrieval-Augmented Knowledge
Viaarxiv icon

GLIGEN: Open-Set Grounded Text-to-Image Generation

Add code
Bookmark button
Alert button
Jan 17, 2023
Yuheng Li, Haotian Liu, Qingyang Wu, Fangzhou Mu, Jianwei Yang, Jianfeng Gao, Chunyuan Li, Yong Jae Lee

Figure 1 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 2 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 3 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Figure 4 for GLIGEN: Open-Set Grounded Text-to-Image Generation
Viaarxiv icon

Generalized Decoding for Pixel, Image, and Language

Add code
Bookmark button
Alert button
Dec 21, 2022
Xueyan Zou, Zi-Yi Dou, Jianwei Yang, Zhe Gan, Linjie Li, Chunyuan Li, Xiyang Dai, Harkirat Behl, Jianfeng Wang, Lu Yuan, Nanyun Peng, Lijuan Wang, Yong Jae Lee, Jianfeng Gao

Figure 1 for Generalized Decoding for Pixel, Image, and Language
Figure 2 for Generalized Decoding for Pixel, Image, and Language
Figure 3 for Generalized Decoding for Pixel, Image, and Language
Figure 4 for Generalized Decoding for Pixel, Image, and Language
Viaarxiv icon

Hierarchical Transformer for Survival Prediction Using Multimodality Whole Slide Images and Genomics

Add code
Bookmark button
Alert button
Nov 29, 2022
Chunyuan Li, Xinliang Zhu, Jiawen Yao, Junzhou Huang

Figure 1 for Hierarchical Transformer for Survival Prediction Using Multimodality Whole Slide Images and Genomics
Figure 2 for Hierarchical Transformer for Survival Prediction Using Multimodality Whole Slide Images and Genomics
Figure 3 for Hierarchical Transformer for Survival Prediction Using Multimodality Whole Slide Images and Genomics
Figure 4 for Hierarchical Transformer for Survival Prediction Using Multimodality Whole Slide Images and Genomics
Viaarxiv icon

Lafite2: Few-shot Text-to-Image Generation

Add code
Bookmark button
Alert button
Oct 25, 2022
Yufan Zhou, Chunyuan Li, Changyou Chen, Jianfeng Gao, Jinhui Xu

Figure 1 for Lafite2: Few-shot Text-to-Image Generation
Figure 2 for Lafite2: Few-shot Text-to-Image Generation
Figure 3 for Lafite2: Few-shot Text-to-Image Generation
Figure 4 for Lafite2: Few-shot Text-to-Image Generation
Viaarxiv icon

Vision-Language Pre-training: Basics, Recent Advances, and Future Trends

Add code
Bookmark button
Alert button
Oct 17, 2022
Zhe Gan, Linjie Li, Chunyuan Li, Lijuan Wang, Zicheng Liu, Jianfeng Gao

Figure 1 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Figure 2 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Figure 3 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Figure 4 for Vision-Language Pre-training: Basics, Recent Advances, and Future Trends
Viaarxiv icon

STT: Soft Template Tuning for Few-Shot Adaptation

Add code
Bookmark button
Alert button
Jul 18, 2022
Ping Yu, Wei Wang, Chunyuan Li, Ruiyi Zhang, Zhanpeng Jin, Changyou Chen

Figure 1 for STT: Soft Template Tuning for Few-Shot Adaptation
Figure 2 for STT: Soft Template Tuning for Few-Shot Adaptation
Figure 3 for STT: Soft Template Tuning for Few-Shot Adaptation
Figure 4 for STT: Soft Template Tuning for Few-Shot Adaptation
Viaarxiv icon