Picture for Alex Ayoub

Alex Ayoub

Switching the Loss Reduces the Cost in Batch Reinforcement Learning

Add code
Mar 12, 2024
Figure 1 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 2 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Figure 3 for Switching the Loss Reduces the Cost in Batch Reinforcement Learning
Viaarxiv icon

Exploration via linearly perturbed loss minimisation

Add code
Nov 13, 2023
Figure 1 for Exploration via linearly perturbed loss minimisation
Figure 2 for Exploration via linearly perturbed loss minimisation
Viaarxiv icon

Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

Add code
Dec 17, 2022
Figure 1 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 2 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 3 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 4 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Viaarxiv icon

An Elementary Proof that Q-learning Converges Almost Surely

Add code
Aug 05, 2021
Viaarxiv icon

Randomized Exploration for Reinforcement Learning with General Value Function Approximation

Add code
Jun 15, 2021
Figure 1 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 2 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 3 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Figure 4 for Randomized Exploration for Reinforcement Learning with General Value Function Approximation
Viaarxiv icon

Model-Based Reinforcement Learning with Value-Targeted Regression

Add code
Jun 01, 2020
Figure 1 for Model-Based Reinforcement Learning with Value-Targeted Regression
Figure 2 for Model-Based Reinforcement Learning with Value-Targeted Regression
Figure 3 for Model-Based Reinforcement Learning with Value-Targeted Regression
Figure 4 for Model-Based Reinforcement Learning with Value-Targeted Regression
Viaarxiv icon