Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning


Jul 15, 2022
Yunhao Tang , Mark Rowland , Rémi Munos , Bernardo Ávila Pires , Will Dabney , Marc G. Bellemare


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

BYOL-Explore: Exploration by Bootstrapped Prediction


Jun 16, 2022
Zhaohan Daniel Guo , Shantanu Thakoor , Miruna Pîslar , Bernardo Avila Pires , Florent Altché , Corentin Tallec , Alaa Saade , Daniele Calandriello , Jean-Bastien Grill , Yunhao Tang , Michal Valko , Rémi Munos , Mohammad Gheshlaghi Azar , Bilal Piot


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal


May 27, 2022
Tadashi Kozuno , Wenhao Yang , Nino Vieillard , Toshinori Kitamura , Yunhao Tang , Jincheng Mei , Pierre Ménard , Mohammad Gheshlaghi Azar , Michal Valko , Rémi Munos , Olivier Pietquin , Matthieu Geist , Csaba Szepesvári

* 29 pages, 6 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

From Dirichlet to Rubin: Optimistic Exploration in RL without Bonuses


May 16, 2022
Daniil Tiapkin , Denis Belomestny , Eric Moulines , Alexey Naumov , Sergey Samsonov , Yunhao Tang , Michal Valko , Pierre Menard


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Marginalized Operators for Off-policy Reinforcement Learning


Mar 30, 2022
Yunhao Tang , Mark Rowland , Rémi Munos , Michal Valko

* Accepted at AISTATS 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Biased Gradient Estimate with Drastic Variance Reduction for Meta Reinforcement Learning


Dec 14, 2021
Yunhao Tang


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Unifying Gradient Estimators for Meta-Reinforcement Learning via Off-Policy Evaluation


Jun 24, 2021
Yunhao Tang , Tadashi Kozuno , Mark Rowland , Rémi Munos , Michal Valko


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Taylor Expansion of Discount Factors


Jun 14, 2021
Yunhao Tang , Mark Rowland , Rémi Munos , Michal Valko

* Accepted at International Conference of Machine Learning (ICML), 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Unlocking Pixels for Reinforcement Learning via Implicit Attention


Mar 04, 2021
Krzysztof Choromanski , Deepali Jain , Jack Parker-Holder , Xingyou Song , Valerii Likhosherstov , Anirban Santara , Aldo Pacchiano , Yunhao Tang , Adrian Weller


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
>>