Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Bjarne Gregori

Deep Reinforcement Learning Agents are not even close to Human Intelligence

May 27, 2025

Quentin Delfosse, Jannis Blüml, Fabian Tatai, Théo Vincent, Bjarne Gregori, Elisabeth Dillies, Jan Peters, Constantin Rothkopf, Kristian Kersting

Abstract:Deep reinforcement learning (RL) agents achieve impressive results in a wide variety of tasks, but they lack zero-shot adaptation capabilities. While most robustness evaluations focus on tasks complexifications, for which human also struggle to maintain performances, no evaluation has been performed on tasks simplifications. To tackle this issue, we introduce HackAtari, a set of task variations of the Arcade Learning Environments. We use it to demonstrate that, contrary to humans, RL agents systematically exhibit huge performance drops on simpler versions of their training tasks, uncovering agents' consistent reliance on shortcuts. Our analysis across multiple algorithms and architectures highlights the persistent gap between RL agents and human behavioral intelligence, underscoring the need for new benchmarks and methodologies that enforce systematic generalization testing beyond static evaluation protocols. Training and testing in the same environment is not enough to obtain agents equipped with human-like intelligence.

* 49 pages in total, 5 main figures, 14 figures total

Via

Access Paper or Ask Questions

HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning

Jun 06, 2024

Quentin Delfosse, Jannis Blüml, Bjarne Gregori, Kristian Kersting

Figure 1 for HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning

Figure 2 for HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning

Figure 3 for HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning

Figure 4 for HackAtari: Atari Learning Environments for Robust and Continual Reinforcement Learning

Abstract:Artificial agents' adaptability to novelty and alignment with intended behavior is crucial for their effective deployment. Reinforcement learning (RL) leverages novelty as a means of exploration, yet agents often struggle to handle novel situations, hindering generalization. To address these issues, we propose HackAtari, a framework introducing controlled novelty to the most common RL benchmark, the Atari Learning Environment. HackAtari allows us to create novel game scenarios (including simplification for curriculum learning), to swap the game elements' colors, as well as to introduce different reward signals for the agent. We demonstrate that current agents trained on the original environments include robustness failures, and evaluate HackAtari's efficacy in enhancing RL agents' robustness and aligning behavior through experiments using C51 and PPO. Overall, HackAtari can be used to improve the robustness of current and future RL algorithms, allowing Neuro-Symbolic RL, curriculum RL, causal RL, as well as LLM-driven RL. Our work underscores the significance of developing interpretable in RL agents.

* 9 main pages, 4 pages references, 19 pages of appendix

Via

Access Paper or Ask Questions

OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Jun 14, 2023

Quentin Delfosse, Jannis Blüml, Bjarne Gregori, Sebastian Sztwiertnia, Kristian Kersting

Figure 1 for OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Figure 2 for OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Figure 3 for OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Figure 4 for OCAtari: Object-Centric Atari 2600 Reinforcement Learning Environments

Abstract:Cognitive science and psychology suggest that object-centric representations of complex scenes are a promising step towards enabling efficient abstract reasoning from low-level perceptual features. Yet, most deep reinforcement learning approaches rely on only pixel-based representations that do not capture the compositional properties of natural scenes. For this, we need environments and datasets that allow us to work and evaluate object-centric approaches. We present OCAtari, a set of environment that provides object-centric state representations of Atari games, the most-used evaluation framework for deep RL approaches. OCAtari also allows for RAM state manipulations of the games to change and create specific or even novel situations. The code base for this work is available at github.com/k4ntz/OC_Atari.

* 26 pages, 9 main paper pages, 14 appendix pages. In main paper: 5 figures, 2 tables

Via

Access Paper or Ask Questions