Picture for Miroslav Štrupl

Miroslav Štrupl

Deep Hedging Under Non-Convexity: Limitations and a Case for AlphaZero

Add code
Oct 02, 2025
Figure 1 for Deep Hedging Under Non-Convexity: Limitations and a Case for AlphaZero
Figure 2 for Deep Hedging Under Non-Convexity: Limitations and a Case for AlphaZero
Figure 3 for Deep Hedging Under Non-Convexity: Limitations and a Case for AlphaZero
Figure 4 for Deep Hedging Under Non-Convexity: Limitations and a Case for AlphaZero
Viaarxiv icon

On the Convergence and Stability of Upside-Down Reinforcement Learning, Goal-Conditioned Supervised Learning, and Online Decision Transformers

Add code
Feb 08, 2025
Viaarxiv icon

Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets

Add code
May 13, 2022
Figure 1 for Upside-Down Reinforcement Learning Can Diverge in Stochastic Environments With Episodic Resets
Viaarxiv icon

Reward-Weighted Regression Converges to a Global Optimum

Add code
Jul 19, 2021
Figure 1 for Reward-Weighted Regression Converges to a Global Optimum
Figure 2 for Reward-Weighted Regression Converges to a Global Optimum
Viaarxiv icon