Picture for Alessandro Suglia

Alessandro Suglia

AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification

Add code
Mar 17, 2026
Viaarxiv icon

Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures

Add code
Mar 03, 2026
Viaarxiv icon

Same Answer, Different Representations: Hidden instability in VLMs

Add code
Feb 06, 2026
Viaarxiv icon

Movie Facts and Fibs (MF$^2$): A Benchmark for Long Movie Understanding

Add code
Jun 06, 2025
Viaarxiv icon

Playpen: An Environment for Exploring Learning Through Conversational Interaction

Add code
Apr 11, 2025
Figure 1 for Playpen: An Environment for Exploring Learning Through Conversational Interaction
Figure 2 for Playpen: An Environment for Exploring Learning Through Conversational Interaction
Figure 3 for Playpen: An Environment for Exploring Learning Through Conversational Interaction
Figure 4 for Playpen: An Environment for Exploring Learning Through Conversational Interaction
Viaarxiv icon

Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests

Add code
Feb 20, 2025
Figure 1 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 2 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 3 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 4 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Viaarxiv icon

CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts

Add code
Oct 20, 2024
Viaarxiv icon

Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models

Add code
Sep 21, 2024
Viaarxiv icon

Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling

Add code
Sep 09, 2024
Figure 1 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 2 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 3 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Figure 4 for Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling
Viaarxiv icon

Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks

Add code
Jul 04, 2024
Figure 1 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Figure 2 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Figure 3 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Figure 4 for Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks
Viaarxiv icon