Picture for Denis Belomestny

Denis Belomestny

Tight Bounds for Schrödinger Potential Estimation in Unpaired Image-to-Image Translation Problems

Add code
Aug 10, 2025
Viaarxiv icon

Accelerating Nash Learning from Human Feedback via Mirror Prox

Add code
May 26, 2025
Viaarxiv icon

Weighted mesh algorithms for general Markov decision processes: Convergence and tractability

Add code
Jun 29, 2024
Figure 1 for Weighted mesh algorithms for general Markov decision processes: Convergence and tractability
Figure 2 for Weighted mesh algorithms for general Markov decision processes: Convergence and tractability
Figure 3 for Weighted mesh algorithms for general Markov decision processes: Convergence and tractability
Figure 4 for Weighted mesh algorithms for general Markov decision processes: Convergence and tractability
Viaarxiv icon

Model-free Posterior Sampling via Learning Rate Randomization

Add code
Oct 27, 2023
Viaarxiv icon

Demonstration-Regularized RL

Add code
Oct 26, 2023
Figure 1 for Demonstration-Regularized RL
Viaarxiv icon

Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms

Add code
Apr 06, 2023
Viaarxiv icon

Theoretical guarantees for neural control variates in MCMC

Add code
Apr 03, 2023
Viaarxiv icon

Fast Rates for Maximum Entropy Exploration

Add code
Mar 14, 2023
Figure 1 for Fast Rates for Maximum Entropy Exploration
Figure 2 for Fast Rates for Maximum Entropy Exploration
Figure 3 for Fast Rates for Maximum Entropy Exploration
Figure 4 for Fast Rates for Maximum Entropy Exploration
Viaarxiv icon

Primal-dual regression approach for Markov decision processes with general state and action space

Add code
Oct 04, 2022
Viaarxiv icon

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

Add code
Sep 28, 2022
Figure 1 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Figure 2 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Figure 3 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Figure 4 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Viaarxiv icon