Picture for Chi Jin

Chi Jin

Provable Self-Play Algorithms for Competitive Reinforcement Learning

Add code
Feb 23, 2020
Figure 1 for Provable Self-Play Algorithms for Competitive Reinforcement Learning
Viaarxiv icon

Reward-Free Exploration for Reinforcement Learning

Add code
Feb 07, 2020
Figure 1 for Reward-Free Exploration for Reinforcement Learning
Figure 2 for Reward-Free Exploration for Reinforcement Learning
Figure 3 for Reward-Free Exploration for Reinforcement Learning
Figure 4 for Reward-Free Exploration for Reinforcement Learning
Viaarxiv icon

Near-Optimal Algorithms for Minimax Optimization

Add code
Feb 05, 2020
Figure 1 for Near-Optimal Algorithms for Minimax Optimization
Figure 2 for Near-Optimal Algorithms for Minimax Optimization
Viaarxiv icon

Learning Adversarial MDPs with Bandit Feedback and Unknown Transition

Add code
Jan 07, 2020
Viaarxiv icon

Provably Efficient Exploration in Policy Optimization

Add code
Dec 12, 2019
Viaarxiv icon

Provably Efficient Reinforcement Learning with Linear Function Approximation

Add code
Aug 08, 2019
Viaarxiv icon

On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems

Add code
Jun 02, 2019
Figure 1 for On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems
Viaarxiv icon

Stochastic Gradient Descent Escapes Saddle Points Efficiently

Add code
Feb 13, 2019
Figure 1 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 2 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 3 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Figure 4 for Stochastic Gradient Descent Escapes Saddle Points Efficiently
Viaarxiv icon

A Short Note on Concentration Inequalities for Random Vectors with SubGaussian Norm

Add code
Feb 11, 2019
Viaarxiv icon

Minmax Optimization: Stable Limit Points of Gradient Descent Ascent are Locally Optimal

Add code
Feb 02, 2019
Figure 1 for Minmax Optimization: Stable Limit Points of Gradient Descent Ascent are Locally Optimal
Figure 2 for Minmax Optimization: Stable Limit Points of Gradient Descent Ascent are Locally Optimal
Viaarxiv icon