Alert button
Picture for Jianfeng Gao

Jianfeng Gao

Alert button

BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys

Oct 18, 2023
Yu Gu, Jianwei Yang, Naoto Usuyama, Chunyuan Li, Sheng Zhang, Matthew P. Lungren, Jianfeng Gao, Hoifung Poon

Figure 1 for BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Figure 2 for BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Figure 3 for BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Figure 4 for BiomedJourney: Counterfactual Biomedical Image Generation by Instruction-Learning from Multimodal Patient Journeys
Viaarxiv icon

Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V

Oct 17, 2023
Jianwei Yang, Hao Zhang, Feng Li, Xueyan Zou, Chunyuan Li, Jianfeng Gao

Figure 1 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Figure 2 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Figure 3 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Figure 4 for Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Viaarxiv icon

Fast-ELECTRA for Efficient Pre-training

Oct 11, 2023
Chengyu Dong, Liyuan Liu, Hao Cheng, Jingbo Shang, Jianfeng Gao, Xiaodong Liu

Figure 1 for Fast-ELECTRA for Efficient Pre-training
Figure 2 for Fast-ELECTRA for Efficient Pre-training
Figure 3 for Fast-ELECTRA for Efficient Pre-training
Figure 4 for Fast-ELECTRA for Efficient Pre-training
Viaarxiv icon

Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs

Oct 07, 2023
Suyu Ge, Yunan Zhang, Liyuan Liu, Minjia Zhang, Jiawei Han, Jianfeng Gao

Figure 1 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Figure 2 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Figure 3 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Figure 4 for Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
Viaarxiv icon

MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts

Oct 03, 2023
Pan Lu, Hritik Bansal, Tony Xia, Jiacheng Liu, Chunyuan Li, Hannaneh Hajishirzi, Hao Cheng, Kai-Wei Chang, Michel Galley, Jianfeng Gao

Viaarxiv icon

Sparse Backpropagation for MoE Training

Oct 01, 2023
Liyuan Liu, Jianfeng Gao, Weizhu Chen

Viaarxiv icon

MindAgent: Emergent Gaming Interaction

Sep 19, 2023
Ran Gong, Qiuyuan Huang, Xiaojian Ma, Hoi Vo, Zane Durante, Yusuke Noda, Zilong Zheng, Song-Chun Zhu, Demetri Terzopoulos, Li Fei-Fei, Jianfeng Gao

Figure 1 for MindAgent: Emergent Gaming Interaction
Figure 2 for MindAgent: Emergent Gaming Interaction
Figure 3 for MindAgent: Emergent Gaming Interaction
Figure 4 for MindAgent: Emergent Gaming Interaction
Viaarxiv icon

Multimodal Foundation Models: From Specialists to General-Purpose Assistants

Sep 18, 2023
Chunyuan Li, Zhe Gan, Zhengyuan Yang, Jianwei Yang, Linjie Li, Lijuan Wang, Jianfeng Gao

Figure 1 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Figure 2 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Figure 3 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Figure 4 for Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Viaarxiv icon

An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models

Sep 18, 2023
Yadong Lu, Chunyuan Li, Haotian Liu, Jianwei Yang, Jianfeng Gao, Yelong Shen

Figure 1 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Figure 2 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Figure 3 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Figure 4 for An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Viaarxiv icon