Alert button
Picture for Baoxiang Wang

Baoxiang Wang

Alert button

Convergence to Nash Equilibrium and No-regret Guarantee in (Markov) Potential Games

Add code
Bookmark button
Alert button
Apr 04, 2024
Jing Dong, Baoxiang Wang, Yaoliang Yu

Viaarxiv icon

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback

Add code
Bookmark button
Alert button
Nov 14, 2023
Canzhe Zhao, Ruofeng Yang, Baoxiang Wang, Xuezhou Zhang, Shuai Li

Viaarxiv icon

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 19, 2023
Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li

Figure 1 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games

Add code
Bookmark button
Alert button
Jun 19, 2023
Jing Dong, Jingyu Wu, Siwei Wang, Baoxiang Wang, Wei Chen

Viaarxiv icon

Online Influence Maximization under Decreasing Cascade Model

Add code
Bookmark button
Alert button
May 19, 2023
Fang Kong, Jize Xie, Baoxiang Wang, Tao Yao, Shuai Li

Figure 1 for Online Influence Maximization under Decreasing Cascade Model
Figure 2 for Online Influence Maximization under Decreasing Cascade Model
Figure 3 for Online Influence Maximization under Decreasing Cascade Model
Viaarxiv icon

Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
May 18, 2023
Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin, Hongyuan Zha

Figure 1 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 2 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 3 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 4 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Viaarxiv icon

Information Design in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
May 08, 2023
Yue Lin, Wenhao Li, Hongyuan Zha, Baoxiang Wang

Figure 1 for Information Design in Multi-Agent Reinforcement Learning
Figure 2 for Information Design in Multi-Agent Reinforcement Learning
Figure 3 for Information Design in Multi-Agent Reinforcement Learning
Figure 4 for Information Design in Multi-Agent Reinforcement Learning
Viaarxiv icon

Diverse Policy Optimization for Structured Action Space

Add code
Bookmark button
Alert button
Feb 23, 2023
Wenhao Li, Baoxiang Wang, Shanchao Yang, Hongyuan Zha

Figure 1 for Diverse Policy Optimization for Structured Action Space
Figure 2 for Diverse Policy Optimization for Structured Action Space
Figure 3 for Diverse Policy Optimization for Structured Action Space
Figure 4 for Diverse Policy Optimization for Structured Action Space
Viaarxiv icon

Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization

Add code
Bookmark button
Alert button
Feb 14, 2023
Fang Kong, Xiangcheng Zhang, Baoxiang Wang, Shuai Li

Figure 1 for Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization
Viaarxiv icon

Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 28, 2022
Qi Tian, Kun Kuang, Furui Liu, Baoxiang Wang

Figure 1 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Figure 2 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Figure 3 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Figure 4 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Viaarxiv icon