Picture for Simeon Junker

Simeon Junker

Are Multimodal Large Language Models Pragmatically Competent Listeners in Simple Reference Resolution Tasks?

Add code
Jun 13, 2025
Viaarxiv icon

SceneGram: Conceptualizing and Describing Tangrams in Scene Context

Add code
Jun 13, 2025
Viaarxiv icon

The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems

Add code
Jun 27, 2024
Figure 1 for The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems
Figure 2 for The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems
Figure 3 for The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems
Figure 4 for The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems
Viaarxiv icon

Resilience through Scene Context in Visual Referring Expression Generation

Add code
Apr 18, 2024
Figure 1 for Resilience through Scene Context in Visual Referring Expression Generation
Figure 2 for Resilience through Scene Context in Visual Referring Expression Generation
Figure 3 for Resilience through Scene Context in Visual Referring Expression Generation
Figure 4 for Resilience through Scene Context in Visual Referring Expression Generation
Viaarxiv icon