Alert button
Picture for Yuejie Chi

Yuejie Chi

Alert button

Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games

Add code
Bookmark button
Alert button
Oct 04, 2022
Shicong Cen, Yuejie Chi, Simon S. Du, Lin Xiao

Figure 1 for Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Figure 2 for Faster Last-iterate Convergence of Policy Optimization in Zero-Sum Markov Games
Viaarxiv icon

Minimax-Optimal Multi-Agent RL in Zero-Sum Markov Games With a Generative Model

Add code
Bookmark button
Alert button
Aug 22, 2022
Gen Li, Yuejie Chi, Yuting Wei, Yuxin Chen

Viaarxiv icon

Local Geometry of Nonconvex Spike Deconvolution from Low-Pass Measurements

Add code
Bookmark button
Alert button
Aug 22, 2022
Maxime Ferreira Da Costa, Yuejie Chi

Figure 1 for Local Geometry of Nonconvex Spike Deconvolution from Low-Pass Measurements
Figure 2 for Local Geometry of Nonconvex Spike Deconvolution from Low-Pass Measurements
Viaarxiv icon

Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity

Add code
Bookmark button
Alert button
Aug 11, 2022
Laixi Shi, Yuejie Chi

Figure 1 for Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity
Figure 2 for Distributionally Robust Model-Based Offline Reinforcement Learning with Near-Optimal Sample Complexity
Viaarxiv icon

SoteriaFL: A Unified Framework for Private Federated Learning with Communication Compression

Add code
Bookmark button
Alert button
Jun 20, 2022
Zhize Li, Haoyu Zhao, Boyue Li, Yuejie Chi

Figure 1 for SoteriaFL: A Unified Framework for Private Federated Learning with Communication Compression
Figure 2 for SoteriaFL: A Unified Framework for Private Federated Learning with Communication Compression
Viaarxiv icon

Fast and Provable Tensor Robust Principal Component Analysis via Scaled Gradient Descent

Add code
Bookmark button
Alert button
Jun 18, 2022
Harry Dong, Tian Tong, Cong Ma, Yuejie Chi

Figure 1 for Fast and Provable Tensor Robust Principal Component Analysis via Scaled Gradient Descent
Figure 2 for Fast and Provable Tensor Robust Principal Component Analysis via Scaled Gradient Descent
Figure 3 for Fast and Provable Tensor Robust Principal Component Analysis via Scaled Gradient Descent
Figure 4 for Fast and Provable Tensor Robust Principal Component Analysis via Scaled Gradient Descent
Viaarxiv icon

Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization

Add code
Bookmark button
Alert button
Apr 12, 2022
Shicong Cen, Fan Chen, Yuejie Chi

Figure 1 for Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Figure 2 for Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Figure 3 for Independent Natural Policy Gradient Methods for Potential Games: Finite-time Global Convergence with Entropy Regularization
Viaarxiv icon

Settling the Sample Complexity of Model-Based Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 11, 2022
Gen Li, Laixi Shi, Yuxin Chen, Yuejie Chi, Yuting Wei

Figure 1 for Settling the Sample Complexity of Model-Based Offline Reinforcement Learning
Figure 2 for Settling the Sample Complexity of Model-Based Offline Reinforcement Learning
Viaarxiv icon

Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity

Add code
Bookmark button
Alert button
Feb 28, 2022
Laixi Shi, Gen Li, Yuting Wei, Yuxin Chen, Yuejie Chi

Figure 1 for Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Figure 2 for Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity
Viaarxiv icon

BEER: Fast $O(1/T)$ Rate for Decentralized Nonconvex Optimization with Communication Compression

Add code
Bookmark button
Alert button
Jan 31, 2022
Haoyu Zhao, Boyue Li, Zhize Li, Peter Richtárik, Yuejie Chi

Figure 1 for BEER: Fast $O(1/T)$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Figure 2 for BEER: Fast $O(1/T)$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Figure 3 for BEER: Fast $O(1/T)$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Figure 4 for BEER: Fast $O(1/T)$ Rate for Decentralized Nonconvex Optimization with Communication Compression
Viaarxiv icon