Picture for Samuele Marro

Samuele Marro

Profit is the Red Team: Stress-Testing Agents in Strategic Economic Interactions

Add code
Mar 21, 2026
Viaarxiv icon

Benchmarking at the Edge of Comprehension

Add code
Feb 15, 2026
Viaarxiv icon

An End-to-end Planning Framework with Agentic LLMs and PDDL

Add code
Dec 10, 2025
Viaarxiv icon

Large Language Models Miss the Multi-Agent Mark

Add code
May 27, 2025
Figure 1 for Large Language Models Miss the Multi-Agent Mark
Viaarxiv icon

Language Models Are Implicitly Continuous

Add code
Apr 04, 2025
Viaarxiv icon

Code Simulation as a Proxy for High-order Tasks in Large Language Models

Add code
Feb 05, 2025
Figure 1 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Figure 2 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Figure 3 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Figure 4 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Viaarxiv icon

Jailbreaking Large Language Models in Infinitely Many Ways

Add code
Jan 18, 2025
Figure 1 for Jailbreaking Large Language Models in Infinitely Many Ways
Figure 2 for Jailbreaking Large Language Models in Infinitely Many Ways
Figure 3 for Jailbreaking Large Language Models in Infinitely Many Ways
Figure 4 for Jailbreaking Large Language Models in Infinitely Many Ways
Viaarxiv icon

Authenticated Delegation and Authorized AI Agents

Add code
Jan 16, 2025
Figure 1 for Authenticated Delegation and Authorized AI Agents
Figure 2 for Authenticated Delegation and Authorized AI Agents
Figure 3 for Authenticated Delegation and Authorized AI Agents
Figure 4 for Authenticated Delegation and Authorized AI Agents
Viaarxiv icon

A Scalable Communication Protocol for Networks of Large Language Models

Add code
Oct 14, 2024
Figure 1 for A Scalable Communication Protocol for Networks of Large Language Models
Figure 2 for A Scalable Communication Protocol for Networks of Large Language Models
Figure 3 for A Scalable Communication Protocol for Networks of Large Language Models
Figure 4 for A Scalable Communication Protocol for Networks of Large Language Models
Viaarxiv icon

A Notion of Complexity for Theory of Mind via Discrete World Models

Add code
Jun 16, 2024
Viaarxiv icon