Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Provable Defense against Backdoor Policies in Reinforcement Learning


Nov 18, 2022
Shubham Kumar Bharti, Xuezhou Zhang, Adish Singla, Xiaojin Zhu

Add code

* Accepted at Neurips 2022 

   Access Paper or Ask Questions

Representation Learning for General-sum Low-rank Markov Games


Oct 30, 2022
Chengzhuo Ni, Yuda Song, Xuezhou Zhang, Chi Jin, Mengdi Wang

Add code


   Access Paper or Ask Questions

Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization


Jun 29, 2022
Kaixuan Huang, Yu Wu, Xuezhou Zhang, Shenyinying Tu, Qingyun Wu, Mengdi Wang, Huazheng Wang

Add code


   Access Paper or Ask Questions

Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks


Jun 22, 2022
Shuoguang Yang, Xuezhou Zhang, Mengdi Wang

Add code


   Access Paper or Ask Questions

Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization


Jun 05, 2022
Hui Yuan, Chengzhuo Ni, Huazheng Wang, Xuezhou Zhang, Le Cong, Csaba SzepesvΓ‘ri, Mengdi Wang

Add code


   Access Paper or Ask Questions

Byzantine-Robust Online and Offline Distributed Reinforcement Learning


Jun 01, 2022
Yiding Chen, Xuezhou Zhang, Kaiqing Zhang, Mengdi Wang, Xiaojin Zhu

Add code


   Access Paper or Ask Questions

Provable Benefits of Representational Transfer in Reinforcement Learning


May 29, 2022
Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang

Add code


   Access Paper or Ask Questions

Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory


Feb 10, 2022
Ruiqi Zhang, Xuezhou Zhang, Chengzhuo Ni, Mengdi Wang

Add code

* 39 pages 

   Access Paper or Ask Questions

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach


Feb 02, 2022
Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun

Add code


   Access Paper or Ask Questions

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration


Jan 31, 2022
Chengzhuo Ni, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang, Mengdi Wang

Add code


   Access Paper or Ask Questions

1
2
3
>>