Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tyler Clark

Recurrent Off-Policy Deep Reinforcement Learning Doesn't Have to be Slow

Dec 23, 2025

Tyler Clark, Christine Evers, Jonathon Hare

Figure 1 for Recurrent Off-Policy Deep Reinforcement Learning Doesn't Have to be Slow

Figure 2 for Recurrent Off-Policy Deep Reinforcement Learning Doesn't Have to be Slow

Figure 3 for Recurrent Off-Policy Deep Reinforcement Learning Doesn't Have to be Slow

Figure 4 for Recurrent Off-Policy Deep Reinforcement Learning Doesn't Have to be Slow

Abstract:Recurrent off-policy deep reinforcement learning models achieve state-of-the-art performance but are often sidelined due to their high computational demands. In response, we introduce RISE (Recurrent Integration via Simplified Encodings), a novel approach that can leverage recurrent networks in any image-based off-policy RL setting without significant computational overheads via using both learnable and non-learnable encoder layers. When integrating RISE into leading non-recurrent off-policy RL algorithms, we observe a 35.6% human-normalized interquartile mean (IQM) performance improvement across the Atari benchmark. We analyze various implementation strategies to highlight the versatility and potential of our proposed framework.

Via

Access Paper or Ask Questions

Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Nov 06, 2024

Tyler Clark, Mark Towers, Christine Evers, Jonathon Hare

Figure 1 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Figure 2 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Figure 3 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Figure 4 for Beyond The Rainbow: High Performance Deep Reinforcement Learning On A Desktop PC

Abstract:Rainbow Deep Q-Network (DQN) demonstrated combining multiple independent enhancements could significantly boost a reinforcement learning (RL) agent's performance. In this paper, we present "Beyond The Rainbow" (BTR), a novel algorithm that integrates six improvements from across the RL literature to Rainbow DQN, establishing a new state-of-the-art for RL using a desktop PC, with a human-normalized interquartile mean (IQM) of 7.4 on atari-60. Beyond Atari, we demonstrate BTR's capability to handle complex 3D games, successfully training agents to play Super Mario Galaxy, Mario Kart, and Mortal Kombat with minimal algorithmic changes. Designing BTR with computational efficiency in mind, agents can be trained using a desktop PC on 200 million Atari frames within 12 hours. Additionally, we conduct detailed ablation studies of each component, analzying the performance and impact using numerous measures.

* 9 main pages, 26 total. Currently under review at ICLR

Via

Access Paper or Ask Questions