Alert button
Picture for Sean Meyn

Sean Meyn

Alert button

The Curse of Memory in Stochastic Approximation: Extended Version

Add code
Bookmark button
Alert button
Sep 17, 2023
Caio Kalil Lauand, Sean Meyn

Figure 1 for The Curse of Memory in Stochastic Approximation: Extended Version
Figure 2 for The Curse of Memory in Stochastic Approximation: Extended Version
Figure 3 for The Curse of Memory in Stochastic Approximation: Extended Version
Figure 4 for The Curse of Memory in Stochastic Approximation: Extended Version
Viaarxiv icon

Convex Q Learning in a Stochastic Environment: Extended Version

Add code
Bookmark button
Alert button
Sep 10, 2023
Fan Lu, Sean Meyn

Figure 1 for Convex Q Learning in a Stochastic Environment: Extended Version
Figure 2 for Convex Q Learning in a Stochastic Environment: Extended Version
Figure 3 for Convex Q Learning in a Stochastic Environment: Extended Version
Viaarxiv icon

Stability of Q-Learning Through Design and Optimism

Add code
Bookmark button
Alert button
Jul 05, 2023
Sean Meyn

Figure 1 for Stability of Q-Learning Through Design and Optimism
Figure 2 for Stability of Q-Learning Through Design and Optimism
Figure 3 for Stability of Q-Learning Through Design and Optimism
Figure 4 for Stability of Q-Learning Through Design and Optimism
Viaarxiv icon

Sufficient Exploration for Convex Q-learning

Add code
Bookmark button
Alert button
Oct 17, 2022
Fan Lu, Prashant Mehta, Sean Meyn, Gergely Neu

Figure 1 for Sufficient Exploration for Convex Q-learning
Figure 2 for Sufficient Exploration for Convex Q-learning
Figure 3 for Sufficient Exploration for Convex Q-learning
Figure 4 for Sufficient Exploration for Convex Q-learning
Viaarxiv icon

Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time

Add code
Bookmark button
Alert button
Oct 14, 2022
Fan Lu, Joel Mathias, Sean Meyn, Karanjit Kalsi

Figure 1 for Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time
Figure 2 for Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time
Figure 3 for Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time
Figure 4 for Model-Free Characterizations of the Hamilton-Jacobi-Bellman Equation and Convex Q-Learning in Continuous Time
Viaarxiv icon

The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 27, 2021
Vivek Borkar, Shuhang Chen, Adithya Devraj, Ioannis Kontoyiannis, Sean Meyn

Figure 1 for The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning
Figure 2 for The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning
Figure 3 for The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning
Figure 4 for The ODE Method for Asymptotic Statistics in Stochastic Approximation and Reinforcement Learning
Viaarxiv icon

Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation

Add code
Bookmark button
Alert button
Oct 01, 2020
Shuhang Chen, Adithya Devraj, Andrey Bernstein, Sean Meyn

Figure 1 for Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation
Figure 2 for Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation
Figure 3 for Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation
Figure 4 for Accelerating Optimization and Reinforcement Learning with Quasi-Stochastic Approximation
Viaarxiv icon

Explicit Mean-Square Error Bounds for Monte-Carlo and Linear Stochastic Approximation

Add code
Bookmark button
Alert button
Feb 07, 2020
Shuhang Chen, Adithya M. Devraj, Ana Bušić, Sean Meyn

Viaarxiv icon

Zap Q-Learning With Nonlinear Function Approximation

Add code
Bookmark button
Alert button
Oct 11, 2019
Shuhang Chen, Adithya M. Devraj, Ana Bušić, Sean Meyn

Figure 1 for Zap Q-Learning With Nonlinear Function Approximation
Figure 2 for Zap Q-Learning With Nonlinear Function Approximation
Viaarxiv icon