Alert button
Picture for Baoxiang Wang

Baoxiang Wang

Alert button

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback

Nov 14, 2023
Canzhe Zhao, Ruofeng Yang, Baoxiang Wang, Xuezhou Zhang, Shuai Li

Viaarxiv icon

DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning

Aug 19, 2023
Canzhe Zhao, Yanjie Ze, Jing Dong, Baoxiang Wang, Shuai Li

Figure 1 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for DPMAC: Differentially Private Communication for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games

Jun 19, 2023
Jing Dong, Jingyu Wu, Siwei Wang, Baoxiang Wang, Wei Chen

Viaarxiv icon

Online Influence Maximization under Decreasing Cascade Model

May 19, 2023
Fang Kong, Jize Xie, Baoxiang Wang, Tao Yao, Shuai Li

Figure 1 for Online Influence Maximization under Decreasing Cascade Model
Figure 2 for Online Influence Maximization under Decreasing Cascade Model
Figure 3 for Online Influence Maximization under Decreasing Cascade Model
Viaarxiv icon

Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning

May 18, 2023
Wenhao Li, Dan Qiao, Baoxiang Wang, Xiangfeng Wang, Bo Jin, Hongyuan Zha

Figure 1 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 2 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 3 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Figure 4 for Semantically Aligned Task Decomposition in Multi-Agent Reinforcement Learning
Viaarxiv icon

Information Design in Multi-Agent Reinforcement Learning

May 08, 2023
Yue Lin, Wenhao Li, Hongyuan Zha, Baoxiang Wang

Figure 1 for Information Design in Multi-Agent Reinforcement Learning
Figure 2 for Information Design in Multi-Agent Reinforcement Learning
Figure 3 for Information Design in Multi-Agent Reinforcement Learning
Figure 4 for Information Design in Multi-Agent Reinforcement Learning
Viaarxiv icon

Diverse Policy Optimization for Structured Action Space

Feb 23, 2023
Wenhao Li, Baoxiang Wang, Shanchao Yang, Hongyuan Zha

Figure 1 for Diverse Policy Optimization for Structured Action Space
Figure 2 for Diverse Policy Optimization for Structured Action Space
Figure 3 for Diverse Policy Optimization for Structured Action Space
Figure 4 for Diverse Policy Optimization for Structured Action Space
Viaarxiv icon

Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization

Feb 14, 2023
Fang Kong, Xiangcheng Zhang, Baoxiang Wang, Shuai Li

Figure 1 for Improved Regret Bounds for Linear Adversarial MDPs via Linear Optimization
Viaarxiv icon

Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning

Nov 28, 2022
Qi Tian, Kun Kuang, Furui Liu, Baoxiang Wang

Figure 1 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Figure 2 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Figure 3 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Figure 4 for Learning From Good Trajectories in Offline Multi-Agent Reinforcement Learning
Viaarxiv icon

Online Policy Optimization for Robust MDP

Sep 28, 2022
Jing Dong, Jingwei Li, Baoxiang Wang, Jingzhao Zhang

Figure 1 for Online Policy Optimization for Robust MDP
Figure 2 for Online Policy Optimization for Robust MDP
Figure 3 for Online Policy Optimization for Robust MDP
Figure 4 for Online Policy Optimization for Robust MDP
Viaarxiv icon