Alert button
Picture for Xuezhou Zhang

Xuezhou Zhang

Alert button

Byzantine-Robust Online and Offline Distributed Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 01, 2022
Yiding Chen, Xuezhou Zhang, Kaiqing Zhang, Mengdi Wang, Xiaojin Zhu

Viaarxiv icon

Provable Benefits of Representational Transfer in Reinforcement Learning

Add code
Bookmark button
Alert button
May 29, 2022
Alekh Agarwal, Yuda Song, Wen Sun, Kaiwen Wang, Mengdi Wang, Xuezhou Zhang

Figure 1 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 2 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 3 for Provable Benefits of Representational Transfer in Reinforcement Learning
Figure 4 for Provable Benefits of Representational Transfer in Reinforcement Learning
Viaarxiv icon

Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory

Add code
Bookmark button
Alert button
Feb 10, 2022
Ruiqi Zhang, Xuezhou Zhang, Chengzhuo Ni, Mengdi Wang

Figure 1 for Off-Policy Fitted Q-Evaluation with Differentiable Function Approximators: Z-Estimation and Inference Theory
Viaarxiv icon

Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach

Add code
Bookmark button
Alert button
Feb 02, 2022
Xuezhou Zhang, Yuda Song, Masatoshi Uehara, Mengdi Wang, Alekh Agarwal, Wen Sun

Figure 1 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 2 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 3 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Figure 4 for Efficient Reinforcement Learning in Block MDPs: A Model-free Representation Learning Approach
Viaarxiv icon

Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration

Add code
Bookmark button
Alert button
Jan 31, 2022
Chengzhuo Ni, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang, Mengdi Wang

Figure 1 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Figure 2 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Figure 3 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Figure 4 for Optimal Estimation of Off-Policy Policy Gradient via Double Fitted Iteration
Viaarxiv icon

Representation Learning for Online and Offline RL in Low-rank MDPs

Add code
Bookmark button
Alert button
Oct 09, 2021
Masatoshi Uehara, Xuezhou Zhang, Wen Sun

Figure 1 for Representation Learning for Online and Offline RL in Low-rank MDPs
Figure 2 for Representation Learning for Online and Offline RL in Low-rank MDPs
Viaarxiv icon

Corruption-Robust Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 11, 2021
Xuezhou Zhang, Yiding Chen, Jerry Zhu, Wen Sun

Figure 1 for Corruption-Robust Offline Reinforcement Learning
Viaarxiv icon

Controllable and Diverse Text Generation in E-commerce

Add code
Bookmark button
Alert button
Feb 23, 2021
Huajie Shao, Jun Wang, Haohong Lin, Xuezhou Zhang, Aston Zhang, Heng Ji, Tarek Abdelzaher

Figure 1 for Controllable and Diverse Text Generation in E-commerce
Figure 2 for Controllable and Diverse Text Generation in E-commerce
Figure 3 for Controllable and Diverse Text Generation in E-commerce
Figure 4 for Controllable and Diverse Text Generation in E-commerce
Viaarxiv icon

Reward Poisoning in Reinforcement Learning: Attacks Against Unknown Learners in Unknown Environments

Add code
Bookmark button
Alert button
Feb 16, 2021
Amin Rakhsha, Xuezhou Zhang, Xiaojin Zhu, Adish Singla

Viaarxiv icon

Robust Policy Gradient against Strong Data Corruption

Add code
Bookmark button
Alert button
Feb 16, 2021
Xuezhou Zhang, Yiding Chen, Xiaojin Zhu, Wen Sun

Figure 1 for Robust Policy Gradient against Strong Data Corruption
Figure 2 for Robust Policy Gradient against Strong Data Corruption
Figure 3 for Robust Policy Gradient against Strong Data Corruption
Figure 4 for Robust Policy Gradient against Strong Data Corruption
Viaarxiv icon