Picture for Alexey Naumov

Alexey Naumov

Accelerating Nash Learning from Human Feedback via Mirror Prox

Add code
May 26, 2025
Viaarxiv icon

Statistical inference for Linear Stochastic Approximation with Markovian Noise

Add code
May 25, 2025
Viaarxiv icon

A note on concentration inequalities for the overlapped batch mean variance estimators for Markov chains

Add code
May 13, 2025
Viaarxiv icon

Improving GFlowNets with Monte Carlo Tree Search

Add code
Jun 19, 2024
Viaarxiv icon

Group and Shuffle: Efficient Structured Orthogonal Parametrization

Add code
Jun 14, 2024
Viaarxiv icon

Gaussian Approximation and Multiplier Bootstrap for Polyak-Ruppert Averaged Linear Stochastic Approximation with Applications to TD Learning

Add code
May 26, 2024
Viaarxiv icon

SCAFFLSA: Quantifying and Eliminating Heterogeneity Bias in Federated Linear Stochastic Approximation and Temporal Difference Learning

Add code
Feb 06, 2024
Viaarxiv icon

Model-free Posterior Sampling via Learning Rate Randomization

Add code
Oct 27, 2023
Viaarxiv icon

Demonstration-Regularized RL

Add code
Oct 26, 2023
Viaarxiv icon

Generative Flow Networks as Entropy-Regularized RL

Add code
Oct 23, 2023
Figure 1 for Generative Flow Networks as Entropy-Regularized RL
Figure 2 for Generative Flow Networks as Entropy-Regularized RL
Figure 3 for Generative Flow Networks as Entropy-Regularized RL
Figure 4 for Generative Flow Networks as Entropy-Regularized RL
Viaarxiv icon