Alert button
Picture for Pan Xu

Pan Xu

Alert button

Minimax Optimal and Computationally Efficient Algorithms for Distributionally Robust Offline Reinforcement Learning

Mar 14, 2024
Zhishuai Liu, Pan Xu

Viaarxiv icon

Distributionally Robust Off-Dynamics Reinforcement Learning: Provable Efficiency with Linear Function Approximation

Feb 23, 2024
Zhishuai Liu, Pan Xu

Viaarxiv icon

Finite-Time Frequentist Regret Bounds of Multi-Agent Thompson Sampling on Sparse Hypergraphs

Dec 24, 2023
Tianyuan Jin, Hao-Lun Hsu, William Chang, Pan Xu

Viaarxiv icon

Convergence of Sign-based Random Reshuffling Algorithms for Nonconvex Optimization

Oct 24, 2023
Zhen Qin, Zhishuai Liu, Pan Xu

Viaarxiv icon

Optimal Batched Best Arm Identification

Oct 21, 2023
Tianyuan Jin, Yu Yang, Jing Tang, Xiaokui Xiao, Pan Xu

Figure 1 for Optimal Batched Best Arm Identification
Figure 2 for Optimal Batched Best Arm Identification
Figure 3 for Optimal Batched Best Arm Identification
Figure 4 for Optimal Batched Best Arm Identification
Viaarxiv icon

Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits

Sep 19, 2023
Yi Shen, Pan Xu, Michael M. Zavlanos

Figure 1 for Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits
Figure 2 for Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits
Figure 3 for Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits
Figure 4 for Wasserstein Distributionally Robust Policy Evaluation and Learning for Contextual Bandits
Viaarxiv icon

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

May 29, 2023
Haque Ishfaq, Qingfeng Lan, Pan Xu, A. Rupam Mahmood, Doina Precup, Anima Anandkumar, Kamyar Azizzadenesheli

Figure 1 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 2 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 3 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 4 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Viaarxiv icon

Queer In AI: A Case Study in Community-Led Participatory AI

Apr 10, 2023
Organizers Of Queer in AI, Anaelia Ovalle, Arjun Subramonian, Ashwin Singh, Claas Voelcker, Danica J. Sutherland, Davide Locatelli, Eva Breznik, Filip Klubička, Hang Yuan, Hetvi J, Huan Zhang, Jaidev Shriram, Kruno Lehman, Luca Soldaini, Maarten Sap, Marc Peter Deisenroth, Maria Leonor Pacheco, Maria Ryskina, Martin Mundt, Milind Agarwal, Nyx McLean, Pan Xu, A Pranav, Raj Korpan, Ruchira Ray, Sarah Mathew, Sarthak Arora, St John, Tanvi Anand, Vishakha Agrawal, William Agnew, Yanan Long, Zijie J. Wang, Zeerak Talat, Avijit Ghosh, Nathaniel Dennler, Michael Noseworthy, Sharvani Jha, Emi Baylor, Aditya Joshi, Natalia Y. Bilenko, Andrew McNamara, Raphael Gontijo-Lopes, Alex Markham, Evyn Dǒng, Jackie Kay, Manu Saraswat, Nikhil Vytla, Luke Stark

Figure 1 for Queer In AI: A Case Study in Community-Led Participatory AI
Figure 2 for Queer In AI: A Case Study in Community-Led Participatory AI
Figure 3 for Queer In AI: A Case Study in Community-Led Participatory AI
Figure 4 for Queer In AI: A Case Study in Community-Led Participatory AI
Viaarxiv icon

Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning

Nov 30, 2022
Yizhou Zhang, Guannan Qu, Pan Xu, Yiheng Lin, Zaiwei Chen, Adam Wierman

Figure 1 for Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Figure 2 for Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Figure 3 for Global Convergence of Localized Policy Iteration in Networked Multi-Agent Reinforcement Learning
Viaarxiv icon