Picture for Gianni Pellegrini

Gianni Pellegrini

SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models

Add code
Jan 28, 2026
Viaarxiv icon