Picture for Tor Lattimore

Tor Lattimore

Sparse Feature Selection Makes Batch Reinforcement Learning More Sample Efficient

Add code
Nov 08, 2020
Viaarxiv icon

Online Sparse Reinforcement Learning

Add code
Nov 08, 2020
Figure 1 for Online Sparse Reinforcement Learning
Viaarxiv icon

Mirror Descent and the Information Ratio

Add code
Sep 25, 2020
Figure 1 for Mirror Descent and the Information Ratio
Viaarxiv icon

Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation

Add code
Jun 19, 2020
Figure 1 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Figure 2 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Figure 3 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Figure 4 for Improved Regret for Zeroth-Order Adversarial Bandit Convex Optimisation
Viaarxiv icon

Gaussian Gated Linear Networks

Add code
Jun 10, 2020
Figure 1 for Gaussian Gated Linear Networks
Figure 2 for Gaussian Gated Linear Networks
Figure 3 for Gaussian Gated Linear Networks
Figure 4 for Gaussian Gated Linear Networks
Viaarxiv icon

Stochastic matrix games with bandit feedback

Add code
Jun 09, 2020
Figure 1 for Stochastic matrix games with bandit feedback
Figure 2 for Stochastic matrix games with bandit feedback
Figure 3 for Stochastic matrix games with bandit feedback
Figure 4 for Stochastic matrix games with bandit feedback
Viaarxiv icon

Model Selection in Contextual Stochastic Bandit Problems

Add code
Mar 03, 2020
Figure 1 for Model Selection in Contextual Stochastic Bandit Problems
Figure 2 for Model Selection in Contextual Stochastic Bandit Problems
Viaarxiv icon

Information Directed Sampling for Linear Partial Monitoring

Add code
Feb 25, 2020
Figure 1 for Information Directed Sampling for Linear Partial Monitoring
Figure 2 for Information Directed Sampling for Linear Partial Monitoring
Viaarxiv icon

Learning with Good Feature Representations in Bandits and in RL with a Generative Model

Add code
Nov 18, 2019
Viaarxiv icon

Adaptive Exploration in Linear Contextual Bandit

Add code
Oct 15, 2019
Figure 1 for Adaptive Exploration in Linear Contextual Bandit
Figure 2 for Adaptive Exploration in Linear Contextual Bandit
Figure 3 for Adaptive Exploration in Linear Contextual Bandit
Viaarxiv icon