Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Understanding Decision-Time vs. Background Planning in Model-Based Reinforcement Learning


Jun 16, 2022
Safa Alver , Doina Precup


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improving Robustness against Real-World and Worst-Case Distribution Shifts through Decision Region Quantification


May 19, 2022
Leo Schwinn , Leon Bungert , An Nguyen , René Raab , Falk Pulsmeyer , Doina Precup , Björn Eskofier , Dario Zanca


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning


Apr 21, 2022
Gheorghe Comanici , Amelia Glaese , Anita Gergely , Daniel Toyama , Zafarali Ahmed , Tyler Jackson , Philippe Hamel , Doina Precup


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Behind the Machine's Gaze: Biologically Constrained Neural Networks Exhibit Human-like Visual Attention


Apr 19, 2022
Leo Schwinn , Doina Precup , Björn Eskofier , Dario Zanca


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

COptiDICE: Offline Constrained Reinforcement Learning via Stationary Distribution Correction Estimation


Apr 19, 2022
Jongmin Lee , Cosmin Paduraru , Daniel J. Mankowitz , Nicolas Heess , Doina Precup , Kee-Eung Kim , Arthur Guez

* 24 pages, 6 figures, Accepted at ICLR 2022 (spotlight) 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Towards Painless Policy Optimization for Constrained MDPs


Apr 11, 2022
Arushi Jain , Sharan Vaswani , Reza Babanezhad , Csaba Szepesvari , Doina Precup

* Paper under submission. 27 pages, 12 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Selective Credit Assignment


Feb 20, 2022
Veronica Chelu , Diana Borsa , Doina Precup , Hado van Hasselt


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improving Sample Efficiency of Value Based Models Using Attention and Vision Transformers


Feb 01, 2022
Amir Ardalan Kalantari , Mohammad Amini , Sarath Chandar , Doina Precup


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Why Should I Trust You, Bellman? The Bellman Error is a Poor Replacement for Value Error


Jan 28, 2022
Scott Fujimoto , David Meger , Doina Precup , Ofir Nachum , Shixiang Shane Gu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The Paradox of Choice: Using Attention in Hierarchical Reinforcement Learning


Jan 24, 2022
Andrei Nica , Khimya Khetarpal , Doina Precup

* 20 pages, 15 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
5
6
7
>>