Alert button
Picture for Miroslav Štrupl

Miroslav Štrupl

Alert button

Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets

Add code
Bookmark button
Alert button
May 13, 2022
Miroslav Štrupl, Francesco Faccio, Dylan R. Ashley, Jürgen Schmidhuber, Rupesh Kumar Srivastava

Figure 1 for Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets
Viaarxiv icon

Reward-Weighted Regression Converges to a Global Optimum

Add code
Bookmark button
Alert button
Jul 19, 2021
Miroslav Štrupl, Francesco Faccio, Dylan R. Ashley, Rupesh Kumar Srivastava, Jürgen Schmidhuber

Figure 1 for Reward-Weighted Regression Converges to a Global Optimum
Figure 2 for Reward-Weighted Regression Converges to a Global Optimum
Viaarxiv icon