Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Multi-Task Off-Policy Learning from Bandit Feedback


Dec 09, 2022
Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh

Add code

* 14 pages, 3 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

On the Sensitivity of Reward Inference to Misspecified Human Models


Dec 09, 2022
Joey Hong, Kush Bhatia, Anca Dragan

Add code

* 17 pages, 12 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Confidence-Conditioned Value Functions for Offline Reinforcement Learning


Dec 08, 2022
Joey Hong, Aviral Kumar, Sergey Levine

Add code

* 16 pages, NeurIPS 2022 DeepRL Workshop 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?


Apr 12, 2022
Aviral Kumar, Joey Hong, Anikait Singh, Sergey Levine

Add code

* ICLR 2022. First two authors contributed equally 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Compositional Generalization and Decomposition in Neural Program Synthesis


Apr 07, 2022
Kensen Shi, Joey Hong, Manzil Zaheer, Pengcheng Yin, Charles Sutton

Add code

* Published at the Deep Learning for Code (DL4C) Workshop at ICLR 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Deep Hierarchy in Bandits


Feb 03, 2022
Joey Hong, Branislav Kveton, Sumeet Katariya, Manzil Zaheer, Mohammad Ghavamzadeh

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Hierarchical Bayesian Bandits


Nov 12, 2021
Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh

Add code

* 21 pages, 3 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Thompson Sampling with a Mixture Prior


Jun 10, 2021
Joey Hong, Branislav Kveton, Manzil Zaheer, Mohammad Ghavamzadeh, Craig Boutilier

Add code

* 22 pages, 3 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Non-Stationary Latent Bandits


Dec 01, 2020
Joey Hong, Branislav Kveton, Manzil Zaheer, Yinlam Chow, Amr Ahmed, Mohammad Ghavamzadeh, Craig Boutilier

Add code

* 15 pages, 4 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Latent Programmer: Discrete Latent Codes for Program Synthesis


Dec 01, 2020
Joey Hong, David Dohan, Rishabh Singh, Charles Sutton, Manzil Zaheer

Add code

* 15 pages, 9 figures 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
>>