Picture for Abhanshu Sharma

Abhanshu Sharma

Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs

Add code
Mar 19, 2024
Figure 1 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Figure 2 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Figure 3 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Figure 4 for Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon

ScreenAI: A Vision-Language Model for UI and Infographics Understanding

Add code
Feb 19, 2024
Figure 1 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 2 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 3 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Figure 4 for ScreenAI: A Vision-Language Model for UI and Infographics Understanding
Viaarxiv icon

Gemini: A Family of Highly Capable Multimodal Models

Add code
Dec 19, 2023
Viaarxiv icon

Towards Better Semantic Understanding of Mobile Interfaces

Add code
Oct 06, 2022
Figure 1 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 2 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 3 for Towards Better Semantic Understanding of Mobile Interfaces
Figure 4 for Towards Better Semantic Understanding of Mobile Interfaces
Viaarxiv icon