Picture for Tor Lattimore

Tor Lattimore

Adaptivity, Variance and Separation for Adversarial Bandits

Add code
Mar 19, 2019
Figure 1 for Adaptivity, Variance and Separation for Adversarial Bandits
Viaarxiv icon

An Information-Theoretic Approach to Minimax Regret in Partial Monitoring

Add code
Feb 01, 2019
Figure 1 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Figure 2 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Figure 3 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Figure 4 for An Information-Theoretic Approach to Minimax Regret in Partial Monitoring
Viaarxiv icon

A Geometric Perspective on Optimal Representations for Reinforcement Learning

Add code
Jan 31, 2019
Figure 1 for A Geometric Perspective on Optimal Representations for Reinforcement Learning
Figure 2 for A Geometric Perspective on Optimal Representations for Reinforcement Learning
Figure 3 for A Geometric Perspective on Optimal Representations for Reinforcement Learning
Figure 4 for A Geometric Perspective on Optimal Representations for Reinforcement Learning
Viaarxiv icon

Soft-Bayes: Prod for Mixtures of Experts with Log-Loss

Add code
Jan 08, 2019
Viaarxiv icon

Single-Agent Policy Tree Search With Guarantees

Add code
Nov 28, 2018
Figure 1 for Single-Agent Policy Tree Search With Guarantees
Figure 2 for Single-Agent Policy Tree Search With Guarantees
Figure 3 for Single-Agent Policy Tree Search With Guarantees
Viaarxiv icon

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits

Add code
Nov 13, 2018
Figure 1 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Figure 2 for Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Viaarxiv icon

BubbleRank: Safe Online Learning to Rerank

Add code
Jun 15, 2018
Figure 1 for BubbleRank: Safe Online Learning to Rerank
Figure 2 for BubbleRank: Safe Online Learning to Rerank
Figure 3 for BubbleRank: Safe Online Learning to Rerank
Viaarxiv icon

TopRank: A practical algorithm for online stochastic ranking

Add code
Jun 06, 2018
Figure 1 for TopRank: A practical algorithm for online stochastic ranking
Figure 2 for TopRank: A practical algorithm for online stochastic ranking
Viaarxiv icon

Cleaning up the neighborhood: A full classification for adversarial partial monitoring

Add code
May 23, 2018
Figure 1 for Cleaning up the neighborhood: A full classification for adversarial partial monitoring
Figure 2 for Cleaning up the neighborhood: A full classification for adversarial partial monitoring
Viaarxiv icon

Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning

Add code
Jan 02, 2018
Figure 1 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Figure 2 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Figure 3 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Viaarxiv icon