Picture for Sandro Pezzelle

Sandro Pezzelle

University of Amsterdam

They want to pretend not to understand: The Limits of Current LLMs in Interpreting Implicit Content of Political Discourse

Add code
Jun 07, 2025
Viaarxiv icon

Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

Add code
Jun 06, 2025
Viaarxiv icon

Are formal and functional linguistic mechanisms dissociated?

Add code
Mar 14, 2025
Viaarxiv icon

From Tools to Teammates: Evaluating LLMs in Multi-Session Coding Interactions

Add code
Feb 19, 2025
Viaarxiv icon

Natural Language Generation from Visual Sequences: Challenges and Future Directions

Add code
Feb 18, 2025
Viaarxiv icon

Modelling Multimodal Integration in Human Concept Processing with Vision-and-Language Models

Add code
Jul 25, 2024
Viaarxiv icon

Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition

Add code
Jul 05, 2024
Viaarxiv icon

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Add code
Jun 26, 2024
Figure 1 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 2 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 3 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 4 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Viaarxiv icon

Have Faith in Faithfulness: Going Beyond Circuit Overlap When Finding Model Mechanisms

Add code
Mar 26, 2024
Viaarxiv icon

Naming, Describing, and Quantifying Visual Objects in Humans and LLMs

Add code
Mar 13, 2024
Viaarxiv icon