Alert button
Picture for Herke van Hoof

Herke van Hoof

Alert button

Planning with a Learned Policy Basis to Optimally Solve Complex Tasks

Add code
Bookmark button
Alert button
Mar 22, 2024
Guillermo Infante, David Kuric, Anders Jonsson, Vicenç Gómez, Herke van Hoof

Viaarxiv icon

Hierarchical Reinforcement Learning for Power Network Topology Control

Add code
Bookmark button
Alert button
Nov 03, 2023
Blazej Manczak, Jan Viebahn, Herke van Hoof

Viaarxiv icon

Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes

Add code
Bookmark button
Alert button
Sep 11, 2023
Tim Bakker, Herke van Hoof, Max Welling

Figure 1 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Figure 2 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Figure 3 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Figure 4 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Viaarxiv icon

Uncoupled Learning of Differential Stackelberg Equilibria with Commitments

Add code
Bookmark button
Alert button
Feb 07, 2023
Robert Loftin, Mustafa Mert Çelikok, Herke van Hoof, Samuel Kaski, Frans A. Oliehoek

Viaarxiv icon

Reusable Options through Gradient-based Meta Learning

Add code
Bookmark button
Alert button
Dec 22, 2022
David Kuric, Herke van Hoof

Figure 1 for Reusable Options through Gradient-based Meta Learning
Figure 2 for Reusable Options through Gradient-based Meta Learning
Figure 3 for Reusable Options through Gradient-based Meta Learning
Figure 4 for Reusable Options through Gradient-based Meta Learning
Viaarxiv icon

Exposure-Aware Recommendation using Contextual Bandits

Add code
Bookmark button
Alert button
Sep 04, 2022
Masoud Mansoury, Bamshad Mobasher, Herke van Hoof

Figure 1 for Exposure-Aware Recommendation using Contextual Bandits
Figure 2 for Exposure-Aware Recommendation using Contextual Bandits
Figure 3 for Exposure-Aware Recommendation using Contextual Bandits
Figure 4 for Exposure-Aware Recommendation using Contextual Bandits
Viaarxiv icon

Calculus on MDPs: Potential Shaping as a Gradient

Add code
Bookmark button
Alert button
Aug 20, 2022
Erik Jenner, Herke van Hoof, Adam Gleave

Figure 1 for Calculus on MDPs: Potential Shaping as a Gradient
Figure 2 for Calculus on MDPs: Potential Shaping as a Gradient
Figure 3 for Calculus on MDPs: Potential Shaping as a Gradient
Figure 4 for Calculus on MDPs: Potential Shaping as a Gradient
Viaarxiv icon

Logic-based AI for Interpretable Board Game Winner Prediction with Tsetlin Machine

Add code
Bookmark button
Alert button
Mar 08, 2022
Charul Giri, Ole-Christoffer Granmo, Herke van Hoof, Christian D. Blakely

Figure 1 for Logic-based AI for Interpretable Board Game Winner Prediction with Tsetlin Machine
Figure 2 for Logic-based AI for Interpretable Board Game Winner Prediction with Tsetlin Machine
Figure 3 for Logic-based AI for Interpretable Board Game Winner Prediction with Tsetlin Machine
Figure 4 for Logic-based AI for Interpretable Board Game Winner Prediction with Tsetlin Machine
Viaarxiv icon

Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer Empowerment

Add code
Bookmark button
Alert button
Mar 07, 2022
Tessa van der Heiden, Herke van Hoof, Efstratios Gavves, Christoph Salge

Figure 1 for Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer Empowerment
Figure 2 for Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer Empowerment
Figure 3 for Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer Empowerment
Figure 4 for Reliably Re-Acting to Partner's Actions with the Social Intrinsic Motivation of Transfer Empowerment
Viaarxiv icon

Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation

Add code
Bookmark button
Alert button
Mar 07, 2022
Alexander Long, Alan Blair, Herke van Hoof

Figure 1 for Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Figure 2 for Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Figure 3 for Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Figure 4 for Fast and Data Efficient Reinforcement Learning from Pixels via Non-Parametric Value Approximation
Viaarxiv icon