Alert button
Picture for Tor Lattimore

Tor Lattimore

Alert button

Geometric Entropic Exploration

Add code
Bookmark button
Alert button
Jan 06, 2021
Zhaohan Daniel Guo, Mohammad Gheshlagi Azar, Alaa Saade, Shantanu Thakoor, Bilal Piot, Bernardo Avila Pires, Michal Valko, Thomas Mesnard, Tor Lattimore, Rémi Munos

Figure 1 for Geometric Entropic Exploration
Figure 2 for Geometric Entropic Exploration
Figure 3 for Geometric Entropic Exploration
Figure 4 for Geometric Entropic Exploration
Viaarxiv icon

Asymptotically Optimal Information-Directed Sampling

Add code
Bookmark button
Alert button
Nov 11, 2020
Johannes Kirschner, Tor Lattimore, Claire Vernade, Csaba Szepesvári

Figure 1 for Asymptotically Optimal Information-Directed Sampling
Figure 2 for Asymptotically Optimal Information-Directed Sampling
Figure 3 for Asymptotically Optimal Information-Directed Sampling
Figure 4 for Asymptotically Optimal Information-Directed Sampling
Viaarxiv icon

High-Dimensional Sparse Linear Bandits

Add code
Bookmark button
Alert button
Nov 08, 2020
Botao Hao, Tor Lattimore, Mengdi Wang

Figure 1 for High-Dimensional Sparse Linear Bandits
Figure 2 for High-Dimensional Sparse Linear Bandits
Viaarxiv icon

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

Add code
Bookmark button
Alert button
Nov 08, 2020
Botao Hao, Yaqi Duan, Tor Lattimore, Csaba Szepesvári, Mengdi Wang

Viaarxiv icon

Online Sparse Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 08, 2020
Botao Hao, Tor Lattimore, Csaba Szepesvári, Mengdi Wang

Figure 1 for Online Sparse Reinforcement Learning
Viaarxiv icon

Mirror Descent and the Information Ratio

Add code
Bookmark button
Alert button
Sep 25, 2020
Tor Lattimore, András György

Figure 1 for Mirror Descent and the Information Ratio
Viaarxiv icon

Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation

Add code
Bookmark button
Alert button
Jun 19, 2020
Tor Lattimore

Figure 1 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Figure 2 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Figure 3 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Figure 4 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Viaarxiv icon

Gaussian Gated Linear Networks

Add code
Bookmark button
Alert button
Jun 10, 2020
David Budden, Adam Marblestone, Eren Sezener, Tor Lattimore, Greg Wayne, Joel Veness

Figure 1 for Gaussian Gated Linear Networks
Figure 2 for Gaussian Gated Linear Networks
Figure 3 for Gaussian Gated Linear Networks
Figure 4 for Gaussian Gated Linear Networks
Viaarxiv icon

Stochastic matrix games with bandit feedback

Add code
Bookmark button
Alert button
Jun 09, 2020
Brendan O'Donoghue, Tor Lattimore, Ian Osband

Figure 1 for Stochastic matrix games with bandit feedback
Figure 2 for Stochastic matrix games with bandit feedback
Figure 3 for Stochastic matrix games with bandit feedback
Figure 4 for Stochastic matrix games with bandit feedback
Viaarxiv icon