Alert button
Picture for Sean P. Meyn

Sean P. Meyn

Alert button

Convex Q-Learning, Part 1: Deterministic Optimal Control

Add code
Bookmark button
Alert button
Aug 08, 2020
Prashant G. Mehta, Sean P. Meyn

Viaarxiv icon

Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning

Add code
Bookmark button
Alert button
Feb 24, 2020
Adithya M. Devraj, Sean P. Meyn

Figure 1 for Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning
Figure 2 for Q-learning with Uniformly Bounded Variance: Large Discounting is Not a Barrier to Fast Learning
Viaarxiv icon

Zap Q-Learning for Optimal Stopping Time Problems

Add code
Bookmark button
Alert button
May 01, 2019
Shuhang Chen, Adithya M. Devraj, Ana Bušić, Sean P. Meyn

Figure 1 for Zap Q-Learning for Optimal Stopping Time Problems
Viaarxiv icon

Differential Temporal Difference Learning

Add code
Bookmark button
Alert button
Dec 28, 2018
Adithya M. Devraj, Ioannis Kontoyiannis, Sean P. Meyn

Figure 1 for Differential Temporal Difference Learning
Figure 2 for Differential Temporal Difference Learning
Figure 3 for Differential Temporal Difference Learning
Figure 4 for Differential Temporal Difference Learning
Viaarxiv icon

Fastest Convergence for Q-learning

Add code
Bookmark button
Alert button
Mar 21, 2018
Adithya M. Devraj, Sean P. Meyn

Figure 1 for Fastest Convergence for Q-learning
Figure 2 for Fastest Convergence for Q-learning
Figure 3 for Fastest Convergence for Q-learning
Figure 4 for Fastest Convergence for Q-learning
Viaarxiv icon

Differential TD Learning for Value Function Approximation

Add code
Bookmark button
Alert button
Apr 06, 2016
Adithya M. Devraj, Sean P. Meyn

Figure 1 for Differential TD Learning for Value Function Approximation
Figure 2 for Differential TD Learning for Value Function Approximation
Figure 3 for Differential TD Learning for Value Function Approximation
Figure 4 for Differential TD Learning for Value Function Approximation
Viaarxiv icon