Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments


Jul 19, 2022
JB Lanier, Stephen McAleer, Pierre Baldi, Roy Fox


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games


Jul 13, 2022
Stephen McAleer, JB Lanier, Kevin Wang, Pierre Baldi, Roy Fox, Tuomas Sandholm


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning to Query Internet Text for Informing Reinforcement Learning Agents


May 25, 2022
Kolby Nottingham, Alekhya Pyla, Sameer Singh, Roy Fox


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Anytime PSRO for Two-Player Zero-Sum Games


Jan 28, 2022
Stephen McAleer, Kevin Wang, John Lanier, Marc Lanctot, Pierre Baldi, Tuomas Sandholm, Roy Fox

* Published in AAAI Reinforcement Learning in Games Workshop 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Anytime Optimal PSRO for Two-Player Zero-Sum Games


Jan 19, 2022
Stephen McAleer, Kevin Wang, Marc Lanctot, John Lanier, Pierre Baldi, Roy Fox

* Published in AAAI Reinforcement Learning in Games Workshop 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Target Entropy Annealing for Discrete Soft Actor-Critic


Dec 06, 2021
Yaosheng Xu, Dailin Hu, Litian Liang, Stephen McAleer, Pieter Abbeel, Roy Fox

* neurips 2021 deep rl workshop 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Count-Based Temperature Scheduling for Maximum Entropy Reinforcement Learning


Nov 28, 2021
Dailin Hu, Pieter Abbeel, Roy Fox


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Temporal-Difference Value Estimation via Uncertainty-Guided Soft Updates


Oct 28, 2021
Litian Liang, Yaosheng Xu, Stephen McAleer, Dailin Hu, Alexander Ihler, Pieter Abbeel, Roy Fox

* Accepted to Deep Reinforcement Learning Workshop @ NeurIPS 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Independent Natural Policy Gradient Always Converges in Markov Potential Games


Oct 20, 2021
Roy Fox, Stephen McAleer, Will Overman, Ioannis Panageas

* 24 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Modular Framework for Visuomotor Language Grounding


Sep 05, 2021
Kolby Nottingham, Litian Liang, Daeyun Shin, Charless C. Fowlkes, Roy Fox, Sameer Singh


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>