Alert button
Picture for Harm van Seijen

Harm van Seijen

Alert button

Combining Spatial and Temporal Abstraction in Planning for Better Generalization

Add code
Bookmark button
Alert button
Sep 30, 2023
Mingde Zhao, Safa Alver, Harm van Seijen, Romain Laroche, Doina Precup, Yoshua Bengio

Viaarxiv icon

Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 15, 2023
Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Harm van Seijen, Sarath Chandar

Figure 1 for Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
Figure 2 for Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
Figure 3 for Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
Figure 4 for Replay Buffer With Local Forgetting for Adaptive Deep Model-Based Reinforcement Learning
Viaarxiv icon

Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information

Add code
Bookmark button
Alert button
Oct 31, 2022
Riashat Islam, Manan Tomar, Alex Lamb, Yonathan Efroni, Hongyu Zang, Aniket Didolkar, Dipendra Misra, Xin Li, Harm van Seijen, Remi Tachet des Combes, John Langford

Figure 1 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Figure 2 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Figure 3 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Figure 4 for Agent-Controller Representations: Principled Offline RL with Rich Exogenous Information
Viaarxiv icon

Modular Lifelong Reinforcement Learning via Neural Composition

Add code
Bookmark button
Alert button
Jul 01, 2022
Jorge A. Mendez, Harm van Seijen, Eric Eaton

Figure 1 for Modular Lifelong Reinforcement Learning via Neural Composition
Figure 2 for Modular Lifelong Reinforcement Learning via Neural Composition
Figure 3 for Modular Lifelong Reinforcement Learning via Neural Composition
Figure 4 for Modular Lifelong Reinforcement Learning via Neural Composition
Viaarxiv icon

Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods

Add code
Bookmark button
Alert button
Apr 25, 2022
Yi Wan, Ali Rahimi-Kalahroudi, Janarthanan Rajendran, Ida Momennejad, Sarath Chandar, Harm van Seijen

Figure 1 for Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
Figure 2 for Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
Figure 3 for Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
Figure 4 for Towards Evaluating Adaptivity of Model-Based Reinforcement Learning Methods
Viaarxiv icon

Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks

Add code
Bookmark button
Alert button
Jul 13, 2021
Sungryull Sohn, Sungtae Lee, Jongwook Choi, Harm van Seijen, Mehdi Fatemi, Honglak Lee

Figure 1 for Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Figure 2 for Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Figure 3 for Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Figure 4 for Shortest-Path Constrained Reinforcement Learning for Sparse Reward Tasks
Viaarxiv icon

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms

Add code
Bookmark button
Alert button
Oct 02, 2020
Shangtong Zhang, Romain Laroche, Harm van Seijen, Shimon Whiteson, Remi Tachet des Combes

Figure 1 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 2 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 3 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Figure 4 for A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms
Viaarxiv icon

The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 07, 2020
Harm van Seijen, Hadi Nekoei, Evan Racah, Sarath Chandar

Figure 1 for The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
Figure 2 for The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
Figure 3 for The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
Figure 4 for The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning
Viaarxiv icon

Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 03, 2019
Harm van Seijen, Mehdi Fatemi, Arash Tavakoli

Figure 1 for Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning
Figure 2 for Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning
Figure 3 for Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning
Figure 4 for Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning
Viaarxiv icon

Learning Invariances for Policy Generalization

Add code
Bookmark button
Alert button
Sep 07, 2018
Remi Tachet des Combes, Philip Bachman, Harm van Seijen

Figure 1 for Learning Invariances for Policy Generalization
Viaarxiv icon