Picture for Alessandro Suglia

Alessandro Suglia

Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

Add code
Jun 06, 2025
Viaarxiv icon

Playpen: An Environment for Exploring Learning Through Conversational Interaction

Add code
Apr 11, 2025
Viaarxiv icon

Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests

Add code
Feb 20, 2025
Figure 1 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 2 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 3 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 4 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Viaarxiv icon

CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts

Add code
Oct 20, 2024
Viaarxiv icon

Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models

Add code
Sep 21, 2024
Viaarxiv icon

Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling

Add code
Sep 09, 2024
Figure 1 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 2 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 3 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 4 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Viaarxiv icon

Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks

Add code
Jul 04, 2024
Figure 1 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Figure 2 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Figure 3 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Figure 4 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Viaarxiv icon

Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation

Add code
Jun 27, 2024
Figure 1 for Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
Figure 2 for Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
Figure 3 for Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
Figure 4 for Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation
Viaarxiv icon

LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

Add code
Jun 26, 2024
Figure 1 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 2 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 3 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Figure 4 for LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks
Viaarxiv icon

AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Add code
Jun 19, 2024
Viaarxiv icon