Picture for Yuejie Chi

Yuejie Chi

Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization

Add code
Apr 12, 2022
Figure 1 for Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Figure 2 for Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Figure 3 for Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Viaarxiv icon

Settling the Sample Complexity of Model-Based Offline Reinforcement Learning

Add code
Apr 11, 2022
Figure 1 for Settling the Sample Complexity of Model-Based Offline Reinforcement Learning
Figure 2 for Settling the Sample Complexity of Model-Based Offline Reinforcement Learning
Viaarxiv icon

Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity

Add code
Feb 28, 2022
Figure 1 for Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Figure 2 for Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Viaarxiv icon

BEER: Fast $O$ Rate for Decentralized Nonconvex Optimization with Communication Compression

Add code
Jan 31, 2022
Figure 1 for BEER: Fast $O$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Figure 2 for BEER: Fast $O$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Figure 3 for BEER: Fast $O$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Figure 4 for BEER: Fast $O$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Viaarxiv icon

Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning

Add code
Oct 09, 2021
Figure 1 for Breaking the Sample Complexity Barrier to Regret-Optimal Model-Free Reinforcement Learning
Viaarxiv icon

DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization

Add code
Oct 04, 2021
Figure 1 for DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization
Figure 2 for DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization
Figure 3 for DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization
Figure 4 for DESTRESS: Computation-Optimal and Communication-Efficient Decentralized Nonconvex Finite-Sum Optimization
Viaarxiv icon

Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization

Add code
May 31, 2021
Figure 1 for Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization
Figure 2 for Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization
Figure 3 for Fast Policy Extragradient Methods for Competitive Games with Entropy Regularization
Viaarxiv icon

Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence

Add code
May 24, 2021
Figure 1 for Policy Mirror Descent for Regularized Reinforcement Learning: A Generalized Framework with Linear Convergence
Viaarxiv icon

Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting

Add code
May 17, 2021
Figure 1 for Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting
Viaarxiv icon

Scaling and Scalability: Provable Nonconvex Low-Rank Tensor Estimation from Incomplete Measurements

Add code
Apr 29, 2021
Figure 1 for Scaling and Scalability: Provable Nonconvex Low-Rank Tensor Estimation from Incomplete Measurements
Figure 2 for Scaling and Scalability: Provable Nonconvex Low-Rank Tensor Estimation from Incomplete Measurements
Figure 3 for Scaling and Scalability: Provable Nonconvex Low-Rank Tensor Estimation from Incomplete Measurements
Figure 4 for Scaling and Scalability: Provable Nonconvex Low-Rank Tensor Estimation from Incomplete Measurements
Viaarxiv icon