Picture for Zhihong Chen

Zhihong Chen

ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model

Add code
Feb 18, 2024
Figure 1 for ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model
Figure 2 for ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model
Figure 3 for ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model
Figure 4 for ALLaVA: Harnessing GPT4V-synthesized Data for A Lite Vision-Language Model
Viaarxiv icon

CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

Add code
Jan 22, 2024
Figure 1 for CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation
Figure 2 for CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation
Figure 3 for CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation
Figure 4 for CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation
Viaarxiv icon

MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V

Add code
Nov 23, 2023
Figure 1 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 2 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 3 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Figure 4 for MLLM-Bench, Evaluating Multi-modal LLMs using GPT-4V
Viaarxiv icon

Exploiting Low-confidence Pseudo-labels for Source-free Object Detection

Add code
Oct 19, 2023
Viaarxiv icon

AceGPT, Localizing Large Language Models in Arabic

Add code
Sep 22, 2023
Figure 1 for AceGPT, Localizing Large Language Models in Arabic
Figure 2 for AceGPT, Localizing Large Language Models in Arabic
Figure 3 for AceGPT, Localizing Large Language Models in Arabic
Figure 4 for AceGPT, Localizing Large Language Models in Arabic
Viaarxiv icon

CMB: A Comprehensive Medical Benchmark in Chinese

Add code
Aug 17, 2023
Figure 1 for CMB: A Comprehensive Medical Benchmark in Chinese
Figure 2 for CMB: A Comprehensive Medical Benchmark in Chinese
Figure 3 for CMB: A Comprehensive Medical Benchmark in Chinese
Figure 4 for CMB: A Comprehensive Medical Benchmark in Chinese
Viaarxiv icon

Bridging Vision and Language Encoders: Parameter-Efficient Tuning for Referring Image Segmentation

Add code
Jul 21, 2023
Viaarxiv icon

Advancing Visual Grounding with Scene Knowledge: Benchmark and Method

Add code
Jul 21, 2023
Viaarxiv icon

On the Difference of BERT-style and CLIP-style Text Encoders

Add code
Jun 06, 2023
Figure 1 for On the Difference of BERT-style and CLIP-style Text Encoders
Figure 2 for On the Difference of BERT-style and CLIP-style Text Encoders
Figure 3 for On the Difference of BERT-style and CLIP-style Text Encoders
Figure 4 for On the Difference of BERT-style and CLIP-style Text Encoders
Viaarxiv icon

HuatuoGPT, towards Taming Language Model to Be a Doctor

Add code
May 24, 2023
Figure 1 for HuatuoGPT, towards Taming Language Model to Be a Doctor
Figure 2 for HuatuoGPT, towards Taming Language Model to Be a Doctor
Figure 3 for HuatuoGPT, towards Taming Language Model to Be a Doctor
Figure 4 for HuatuoGPT, towards Taming Language Model to Be a Doctor
Viaarxiv icon