Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Online Sparse Reinforcement Learning


Nov 08, 2020
Botao Hao, Tor Lattimore, Csaba Szepesvári, Mengdi Wang

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Mirror Descent and the Information Ratio


Sep 25, 2020
Tor Lattimore, András György

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation


Jun 19, 2020
Tor Lattimore

Add code

* 20 pages, 5 figures. Bound is now improved by d^{1/2} 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Gaussian Gated Linear Networks


Jun 10, 2020
David Budden, Adam Marblestone, Eren Sezener, Tor Lattimore, Greg Wayne, Joel Veness

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Stochastic matrix games with bandit feedback


Jun 09, 2020
Brendan O'Donoghue, Tor Lattimore, Ian Osband

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Model Selection in Contextual Stochastic Bandit Problems


Mar 03, 2020
Aldo Pacchiano, My Phan, Yasin Abbasi-Yadkori, Anup Rao, Julian Zimmert, Tor Lattimore, Csaba Szepesvari

Add code

* 12 main pages, 2 figures, 14 appendix pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Information Directed Sampling for Linear Partial Monitoring


Feb 25, 2020
Johannes Kirschner, Tor Lattimore, Andreas Krause

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Learning with Good Feature Representations in Bandits and in RL with a Generative Model


Nov 18, 2019
Tor Lattimore, Csaba Szepesvari

Add code

* 11 pages 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Adaptive Exploration in Linear Contextual Bandit


Oct 15, 2019
Botao Hao, Tor Lattimore, Csaba Szepesvari

Add code


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
<<
1
2
3
4
5
6
7
8
>>