Picture for Emanuele La Malfa

Emanuele La Malfa

Benchmarking at the Edge of Comprehension

Add code
Feb 15, 2026
Viaarxiv icon

Tacit Coordination of Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon

An End-to-end Planning Framework with Agentic LLMs and PDDL

Add code
Dec 10, 2025
Viaarxiv icon

Large Language Models Miss the Multi-Agent Mark

Add code
May 27, 2025
Viaarxiv icon

Fixed Point Explainability

Add code
May 18, 2025
Figure 1 for Fixed Point Explainability
Figure 2 for Fixed Point Explainability
Figure 3 for Fixed Point Explainability
Figure 4 for Fixed Point Explainability
Viaarxiv icon

Language Models Are Implicitly Continuous

Add code
Apr 04, 2025
Viaarxiv icon

Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning

Add code
Mar 13, 2025
Figure 1 for Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning
Figure 2 for Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning
Figure 3 for Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning
Figure 4 for Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning
Viaarxiv icon

When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits

Add code
Mar 05, 2025
Figure 1 for When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits
Figure 2 for When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits
Figure 3 for When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits
Figure 4 for When Claims Evolve: Evaluating and Enhancing the Robustness of Embedding Models Against Misinformation Edits
Viaarxiv icon

Code Simulation as a Proxy for High-order Tasks in Large Language Models

Add code
Feb 05, 2025
Figure 1 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Figure 2 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Figure 3 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Figure 4 for Code Simulation as a Proxy for High-order Tasks in Large Language Models
Viaarxiv icon

Jailbreaking Large Language Models in Infinitely Many Ways

Add code
Jan 18, 2025
Viaarxiv icon