Picture for Biniyam Aschalew Tolera

Biniyam Aschalew Tolera

By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting

Add code
Jul 15, 2024
Figure 1 for By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting
Figure 2 for By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting
Figure 3 for By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting
Figure 4 for By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting
Viaarxiv icon