Alert button
Picture for Denis Belomestny

Denis Belomestny

Alert button

Model-free Posterior Sampling via Learning Rate Randomization

Add code
Bookmark button
Alert button
Oct 27, 2023
Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Remi Munos, Alexey Naumov, Pierre Perrault, Michal Valko, Pierre Menard

Figure 1 for Model-free Posterior Sampling via Learning Rate Randomization
Figure 2 for Model-free Posterior Sampling via Learning Rate Randomization
Figure 3 for Model-free Posterior Sampling via Learning Rate Randomization
Figure 4 for Model-free Posterior Sampling via Learning Rate Randomization
Viaarxiv icon

Demonstration-Regularized RL

Add code
Bookmark button
Alert button
Oct 26, 2023
Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Alexey Naumov, Pierre Perrault, Michal Valko, Pierre Menard

Viaarxiv icon

Sharp Deviations Bounds for Dirichlet Weighted Sums with Application to analysis of Bayesian algorithms

Add code
Bookmark button
Alert button
Apr 06, 2023
Denis Belomestny, Pierre Menard, Alexey Naumov, Daniil Tiapkin, Michal Valko

Viaarxiv icon

Theoretical guarantees for neural control variates in MCMC

Add code
Bookmark button
Alert button
Apr 03, 2023
Denis Belomestny, Artur Goldman, Alexey Naumov, Sergey Samsonov

Viaarxiv icon

Fast Rates for Maximum Entropy Exploration

Add code
Bookmark button
Alert button
Mar 14, 2023
Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Remi Munos, Alexey Naumov, Pierre Perrault, Yunhao Tang, Michal Valko, Pierre Menard

Figure 1 for Fast Rates for Maximum Entropy Exploration
Figure 2 for Fast Rates for Maximum Entropy Exploration
Figure 3 for Fast Rates for Maximum Entropy Exploration
Figure 4 for Fast Rates for Maximum Entropy Exploration
Viaarxiv icon

Primal-dual regression approach for Markov decision processes with general state and action space

Add code
Bookmark button
Alert button
Oct 04, 2022
Denis Belomestny, John Schoenmakers

Viaarxiv icon

Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees

Add code
Bookmark button
Alert button
Sep 28, 2022
Daniil Tiapkin, Denis Belomestny, Daniele Calandriello, Eric Moulines, Remi Munos, Alexey Naumov, Mark Rowland, Michal Valko, Pierre Menard

Figure 1 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Figure 2 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Figure 3 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Figure 4 for Optimistic Posterior Sampling for Reinforcement Learning with Few Samples and Tight Guarantees
Viaarxiv icon

Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization

Add code
Bookmark button
Alert button
Jun 15, 2022
Maxim Kaledin, Alexander Golubev, Denis Belomestny

Figure 1 for Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization
Figure 2 for Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization
Figure 3 for Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization
Figure 4 for Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization
Viaarxiv icon

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses

Add code
Bookmark button
Alert button
May 16, 2022
Daniil Tiapkin, Denis Belomestny, Eric Moulines, Alexey Naumov, Sergey Samsonov, Yunhao Tang, Michal Valko, Pierre Menard

Figure 1 for From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
Figure 2 for From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
Figure 3 for From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
Figure 4 for From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses
Viaarxiv icon

Ex$^2$MCMC: Sampling through Exploration Exploitation

Add code
Bookmark button
Alert button
Nov 04, 2021
Evgeny Lagutin, Daniil Selikhanovych, Achille Thin, Sergey Samsonov, Alexey Naumov, Denis Belomestny, Maxim Panov, Eric Moulines

Figure 1 for Ex$^2$MCMC: Sampling through Exploration Exploitation
Figure 2 for Ex$^2$MCMC: Sampling through Exploration Exploitation
Figure 3 for Ex$^2$MCMC: Sampling through Exploration Exploitation
Figure 4 for Ex$^2$MCMC: Sampling through Exploration Exploitation
Viaarxiv icon