Picture for Rebecca Ansell

Rebecca Ansell

How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment

Add code
Mar 17, 2026
Viaarxiv icon