Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Towards Flexible Inference in Sequential Decision Problems via Bidirectional Transformers


Apr 28, 2022
Micah Carroll, Jessy Lin, Orr Paradise, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Estimating and Penalizing Induced Preference Shifts in Recommender Systems


Apr 25, 2022
Micah Carroll, Dylan Hadfield-Menell, Stuart Russell, Anca Dragan


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The Boltzmann Policy Distribution: Accounting for Systematic Suboptimality in Human Models


Apr 22, 2022
Cassidy Laidlaw, Anca Dragan

* Published at ICLR 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Inferring Rewards from Language in Context


Apr 05, 2022
Jessy Lin, Daniel Fried, Dan Klein, Anca Dragan

* ACL 2022. Code and dataset: https://github.com/jlin816/rewards-from-language 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Human irrationality: both bad and good for reward inference


Nov 12, 2021
Lawrence Chan, Andrew Critch, Anca Dragan

* 12 pages, 10 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

B-Pref: Benchmarking Preference-Based Reinforcement Learning


Nov 04, 2021
Kimin Lee, Laura Smith, Anca Dragan, Pieter Abbeel

* NeurIPS Datasets and Benchmarks Track 2021. Code is available at https://github.com/rll-research/B-Pref 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

The MineRL BASALT Competition on Learning from Human Feedback


Jul 05, 2021
Rohin Shah, Cody Wild, Steven H. Wang, Neel Alex, Brandon Houghton, William Guss, Sharada Mohanty, Anssi Kanervisto, Stephanie Milani, Nicholay Topin, Pieter Abbeel, Stuart Russell, Anca Dragan

* NeurIPS 2021 Competition Track 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning What To Do by Simulating the Past


May 03, 2021
David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

* Presented at ICLR 2021 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Choice Set Misspecification in Reward Inference


Jan 19, 2021
Rachel Freedman, Rohin Shah, Anca Dragan

* Presented at the IJCAI-PRICAI 2020 Workshop on Artificial Intelligence Safety 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
>>