Picture for Nigel Tao

Nigel Tao

The Optimal Reward Baseline for Gradient-Based Reinforcement Learning

Add code
Jan 10, 2013
Figure 1 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Figure 2 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Figure 3 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Figure 4 for The Optimal Reward Baseline for Gradient-Based Reinforcement Learning
Viaarxiv icon