Picture for Herke van Hoof

Herke van Hoof

Making Universal Policies Universal

Add code
Feb 20, 2025
Figure 1 for Making Universal Policies Universal
Figure 2 for Making Universal Policies Universal
Figure 3 for Making Universal Policies Universal
Figure 4 for Making Universal Policies Universal
Viaarxiv icon

Bridge the Inference Gaps of Neural Processes via Expectation Maximization

Add code
Jan 04, 2025
Figure 1 for Bridge the Inference Gaps of Neural Processes via Expectation Maximization
Figure 2 for Bridge the Inference Gaps of Neural Processes via Expectation Maximization
Figure 3 for Bridge the Inference Gaps of Neural Processes via Expectation Maximization
Figure 4 for Bridge the Inference Gaps of Neural Processes via Expectation Maximization
Viaarxiv icon

Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits

Add code
Aug 08, 2024
Figure 1 for Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits
Figure 2 for Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits
Figure 3 for Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits
Figure 4 for Mitigating Exposure Bias in Online Learning to Rank Recommendation: A Novel Reward Model for Cascading Bandits
Viaarxiv icon

Going Beyond Popularity and Positivity Bias: Correcting for Multifactorial Bias in Recommender Systems

Add code
Apr 29, 2024
Figure 1 for Going Beyond Popularity and Positivity Bias: Correcting for Multifactorial Bias in Recommender Systems
Figure 2 for Going Beyond Popularity and Positivity Bias: Correcting for Multifactorial Bias in Recommender Systems
Figure 3 for Going Beyond Popularity and Positivity Bias: Correcting for Multifactorial Bias in Recommender Systems
Figure 4 for Going Beyond Popularity and Positivity Bias: Correcting for Multifactorial Bias in Recommender Systems
Viaarxiv icon

Planning with a Learned Policy Basis to Optimally Solve Complex Tasks

Add code
Mar 22, 2024
Figure 1 for Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
Figure 2 for Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
Figure 3 for Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
Figure 4 for Planning with a Learned Policy Basis to Optimally Solve Complex Tasks
Viaarxiv icon

Hierarchical Reinforcement Learning for Power Network Topology Control

Add code
Nov 03, 2023
Figure 1 for Hierarchical Reinforcement Learning for Power Network Topology Control
Figure 2 for Hierarchical Reinforcement Learning for Power Network Topology Control
Figure 3 for Hierarchical Reinforcement Learning for Power Network Topology Control
Figure 4 for Hierarchical Reinforcement Learning for Power Network Topology Control
Viaarxiv icon

Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes

Add code
Sep 11, 2023
Figure 1 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Figure 2 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Figure 3 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Figure 4 for Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes
Viaarxiv icon

Uncoupled Learning of Differential Stackelberg Equilibria with Commitments

Add code
Feb 07, 2023
Viaarxiv icon

Reusable Options through Gradient-based Meta Learning

Add code
Dec 22, 2022
Figure 1 for Reusable Options through Gradient-based Meta Learning
Figure 2 for Reusable Options through Gradient-based Meta Learning
Figure 3 for Reusable Options through Gradient-based Meta Learning
Figure 4 for Reusable Options through Gradient-based Meta Learning
Viaarxiv icon

Exposure-Aware Recommendation using Contextual Bandits

Add code
Sep 04, 2022
Figure 1 for Exposure-Aware Recommendation using Contextual Bandits
Figure 2 for Exposure-Aware Recommendation using Contextual Bandits
Figure 3 for Exposure-Aware Recommendation using Contextual Bandits
Figure 4 for Exposure-Aware Recommendation using Contextual Bandits
Viaarxiv icon