Alert button
Picture for Kaiqing Zhang

Kaiqing Zhang

Alert button

Do LLM Agents Have Regret? A Case Study in Online Learning and Games

Add code
Bookmark button
Alert button
Mar 25, 2024
Chanwoo Park, Xiangyu Liu, Asuman Ozdaglar, Kaiqing Zhang

Viaarxiv icon

Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games

Add code
Bookmark button
Alert button
Dec 08, 2023
Zaiwei Chen, Kaiqing Zhang, Eric Mazumdar, Asuman Ozdaglar, Adam Wierman

Viaarxiv icon

Fleet Policy Learning via Weight Merging and An Application to Robotic Tool-Use

Add code
Bookmark button
Alert button
Oct 02, 2023
Lirui Wang, Kaiqing Zhang, Allan Zhou, Max Simchowitz, Russ Tedrake

Viaarxiv icon

Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing

Add code
Bookmark button
Alert button
Aug 16, 2023
Xiangyu Liu, Kaiqing Zhang

Figure 1 for Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing
Figure 2 for Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing
Figure 3 for Partially Observable Multi-agent RL with (Quasi-)Efficiency: The Blessing of Information Sharing
Viaarxiv icon

Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective

Add code
Bookmark button
Alert button
Jul 28, 2023
Max Simchowitz, Abhishek Gupta, Kaiqing Zhang

Figure 1 for Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective
Figure 2 for Tackling Combinatorial Distribution Shift: A Matrix Completion Perspective
Viaarxiv icon

Multi-Player Zero-Sum Markov Games with Networked Separable Interactions

Add code
Bookmark button
Alert button
Jul 13, 2023
Chanwoo Park, Kaiqing Zhang, Asuman Ozdaglar

Figure 1 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Figure 2 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Figure 3 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Figure 4 for Multi-Player Zero-Sum Markov Games with Networked Separable Interactions
Viaarxiv icon

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

Add code
Bookmark button
Alert button
Jun 20, 2023
Dongsheng Ding, Chen-Yu Wei, Kaiqing Zhang, Alejandro Ribeiro

Figure 1 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 2 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 3 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Figure 4 for Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs
Viaarxiv icon

Self-Supervised Reinforcement Learning that Transfers using Random Features

Add code
Bookmark button
Alert button
May 26, 2023
Boyuan Chen, Chuning Zhu, Pulkit Agrawal, Kaiqing Zhang, Abhishek Gupta

Figure 1 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Figure 2 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Figure 3 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Figure 4 for Self-Supervised Reinforcement Learning that Transfers using Random Features
Viaarxiv icon