Picture for Sebastiano Monti

Sebastiano Monti

SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon