Picture for Yizhang Jin

Yizhang Jin

LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description

Add code
Aug 09, 2024
Figure 1 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Figure 2 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Figure 3 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Figure 4 for LLaVA-VSD: Large Language-and-Vision Assistant for Visual Spatial Description
Viaarxiv icon

Efficient Multimodal Large Language Models: A Survey

Add code
May 17, 2024
Figure 1 for Efficient Multimodal Large Language Models: A Survey
Figure 2 for Efficient Multimodal Large Language Models: A Survey
Figure 3 for Efficient Multimodal Large Language Models: A Survey
Figure 4 for Efficient Multimodal Large Language Models: A Survey
Viaarxiv icon

Generalized Category Discovery in Semantic Segmentation

Add code
Nov 20, 2023
Figure 1 for Generalized Category Discovery in Semantic Segmentation
Figure 2 for Generalized Category Discovery in Semantic Segmentation
Figure 3 for Generalized Category Discovery in Semantic Segmentation
Figure 4 for Generalized Category Discovery in Semantic Segmentation
Viaarxiv icon