Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Shangtong Zhang

Truncated Emphatic Temporal Difference Methods for Prediction and Control


Aug 11, 2021
Shangtong Zhang, Shimon Whiteson


  Access Paper or Ask Questions

Learning Expected Emphatic Traces for Deep RL


Jul 12, 2021
Ray Jiang, Shangtong Zhang, Veronica Chelu, Adam White, Hado van Hasselt


  Access Paper or Ask Questions

Breaking the Deadly Triad with a Target Network


Feb 09, 2021
Shangtong Zhang, Hengshuai Yao, Shimon Whiteson


  Access Paper or Ask Questions

Average-Reward Off-Policy Policy Evaluation with Function Approximation


Jan 08, 2021
Shangtong Zhang, Yi Wan, Richard S. Sutton, Shimon Whiteson


  Access Paper or Ask Questions

A Deeper Look at Discounting Mismatch in Actor-Critic Algorithms


Oct 02, 2020
Shangtong Zhang, Romain Laroche, Harm van Seijen, Shimon Whiteson, Remi Tachet des Combes


  Access Paper or Ask Questions

Learning Retrospective Knowledge with Reverse Reinforcement Learning


Jul 09, 2020
Shangtong Zhang, Vivek Veeriah, Shimon Whiteson


  Access Paper or Ask Questions

Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning


May 27, 2020
Shangtong Zhang, Bo Liu, Shimon Whiteson


  Access Paper or Ask Questions

Per-Step Reward: A New Perspective for Risk-Averse Reinforcement Learning


Apr 22, 2020
Shangtong Zhang, Bo Liu, Shimon Whiteson


  Access Paper or Ask Questions

GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values


Feb 07, 2020
Shangtong Zhang, Bo Liu, Shimon Whiteson


  Access Paper or Ask Questions

Provably Convergent Off-Policy Actor-Critic with Function Approximation


Nov 11, 2019
Shangtong Zhang, Bo Liu, Hengshuai Yao, Shimon Whiteson

* Optimization Foundations of Reinforcement Learning Workshop at NeurIPS 2019 

  Access Paper or Ask Questions

Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards


May 30, 2019
Yuhang Song, Jianyi Wang, Thomas Lukasiewicz, Zhenghua Xu, Shangtong Zhang, Mai Xu


  Access Paper or Ask Questions

DAC: The Double Actor-Critic Architecture for Learning Options


May 16, 2019
Shangtong Zhang, Shimon Whiteson


  Access Paper or Ask Questions

Distributional Reinforcement Learning for Efficient Exploration


May 13, 2019
Borislav Mavrin, Shangtong Zhang, Hengshuai Yao, Linglong Kong, Kaiwen Wu, Yaoliang Yu

* ICML, 2019 

  Access Paper or Ask Questions

Deep Residual Reinforcement Learning


May 03, 2019
Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson


  Access Paper or Ask Questions

Generalized Off-Policy Actor-Critic


Mar 27, 2019
Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson


  Access Paper or Ask Questions

QUOTA: The Quantile Option Architecture for Reinforcement Learning


Nov 07, 2018
Shangtong Zhang, Borislav Mavrin, Linglong Kong, Bo Liu, Hengshuai Yao

* AAAI 2019 

  Access Paper or Ask Questions

ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search


Nov 06, 2018
Shangtong Zhang, Hao Chen, Hengshuai Yao

* AAAI 2019 

  Access Paper or Ask Questions

A Deeper Look at Experience Replay


Apr 30, 2018
Shangtong Zhang, Richard S. Sutton

* NIPS 2017 Deep Reinforcement Learning Symposium 

  Access Paper or Ask Questions

Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control


Mar 07, 2018
Shangtong Zhang, Osmar R. Zaiane

* NIPS 2017 Deep Reinforcement Learning Symposium 

  Access Paper or Ask Questions

Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks


Apr 27, 2017
Vivek Veeriah, Shangtong Zhang, Richard S. Sutton


  Access Paper or Ask Questions