Picture for Mario Giulianelli

Mario Giulianelli

Shammie

A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents

Add code
Feb 09, 2026
Viaarxiv icon

Reasoning aligns language models to human cognition

Add code
Feb 09, 2026
Viaarxiv icon

Structure-Conditional Minimum Bayes Risk Decoding

Add code
Oct 23, 2025
Viaarxiv icon

Establishing Best Practices for Building Rigorous Agentic Benchmarks

Add code
Jul 03, 2025
Figure 1 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Figure 2 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Figure 3 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Figure 4 for Establishing Best Practices for Building Rigorous Agentic Benchmarks
Viaarxiv icon

Language Models over Canonical Byte-Pair Encodings

Add code
Jun 09, 2025
Viaarxiv icon

Information Locality as an Inductive Bias for Neural Language Models

Add code
Jun 05, 2025
Figure 1 for Information Locality as an Inductive Bias for Neural Language Models
Figure 2 for Information Locality as an Inductive Bias for Neural Language Models
Figure 3 for Information Locality as an Inductive Bias for Neural Language Models
Figure 4 for Information Locality as an Inductive Bias for Neural Language Models
Viaarxiv icon

The Harmonic Structure of Information Contours

Add code
Jun 04, 2025
Figure 1 for The Harmonic Structure of Information Contours
Figure 2 for The Harmonic Structure of Information Contours
Figure 3 for The Harmonic Structure of Information Contours
Figure 4 for The Harmonic Structure of Information Contours
Viaarxiv icon

Playpen: An Environment for Exploring Learning Through Conversational Interaction

Add code
Apr 11, 2025
Viaarxiv icon

Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests

Add code
Feb 20, 2025
Figure 1 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 2 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 3 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Figure 4 for Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests
Viaarxiv icon

From Language Models over Tokens to Language Models over Characters

Add code
Dec 04, 2024
Viaarxiv icon