Alert button
Picture for Kaiqing Zhang

Kaiqing Zhang

Alert button

Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games

Dec 08, 2023
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

Viaarxiv icon

Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Oct 02, 2023
Lirui Wang, Kaiqing Zhang, Allan Zhou, Max Simchowitz, Russ Tedrake

Viaarxiv icon

Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing

Aug 16, 2023
Xiangyu Liu, Kaiqing Zhang

Figure 1 for Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing
Figure 2 for Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing
Figure 3 for Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing
Viaarxiv icon

Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective

Jul 28, 2023
Max Simchowitz, Abhishek Gupta, Kaiqing Zhang

Figure 1 for Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective
Figure 2 for Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective
Viaarxiv icon

Multi-Player Zero-Sum Markov Games with Networked Separable Interactions

Jul 13, 2023
Chanwoo Park, Kaiqing Zhang, Asuman Ozdaglar

Figure 1 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Figure 2 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Figure 3 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Figure 4 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Viaarxiv icon

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

Jun 20, 2023
Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Alejandro Ribeiro

Figure 1 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 2 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 3 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 4 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Viaarxiv icon

Self-Supervised Reinforcement Learning that Transfers using Random Features

May 26, 2023
Boyuan Chen, Chuning Zhu, Pulkit Agrawal, Kaiqing Zhang, Abhishek Gupta

Figure 1 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Figure 2 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Figure 3 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Figure 4 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Viaarxiv icon

Learning to Extrapolate: A Transductive Approach

Apr 27, 2023
Aviv Netanyahu, Abhishek Gupta, Max Simchowitz, Kaiqing Zhang, Pulkit Agrawal

Figure 1 for Learning to Extrapolate: A Transductive Approach
Figure 2 for Learning to Extrapolate: A Transductive Approach
Figure 3 for Learning to Extrapolate: A Transductive Approach
Figure 4 for Learning to Extrapolate: A Transductive Approach
Viaarxiv icon