Picture for Tor Lattimore

Tor Lattimore

Online Newton Method for Bandit Convex Optimisation

Add code
Jun 10, 2024
Viaarxiv icon

Bandit Convex Optimisation

Add code
Feb 09, 2024
Figure 1 for Bandit Convex Optimisation
Figure 2 for Bandit Convex Optimisation
Figure 3 for Bandit Convex Optimisation
Figure 4 for Bandit Convex Optimisation
Viaarxiv icon

Probabilistic Inference in Reinforcement Learning Done Right

Add code
Nov 22, 2023
Viaarxiv icon

Context-lumpable stochastic bandits

Add code
Jun 22, 2023
Viaarxiv icon

Sequential Best-Arm Identification with Application to Brain-Computer Interface

Add code
May 17, 2023
Viaarxiv icon

A Second-Order Method for Stochastic Bandit Convex Optimisation

Add code
Feb 10, 2023
Figure 1 for A Second-Order Method for Stochastic Bandit Convex Optimisation
Figure 2 for A Second-Order Method for Stochastic Bandit Convex Optimisation
Viaarxiv icon

Leveraging Demonstrations to Improve Online Learning: Quality Matters

Add code
Feb 08, 2023
Figure 1 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 2 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 3 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 4 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Viaarxiv icon

Linear Partial Monitoring for Sequential Decision-Making: Algorithms, Regret Bounds and Applications

Add code
Feb 07, 2023
Viaarxiv icon

Regret Bounds for Information-Directed Reinforcement Learning

Add code
Jun 09, 2022
Viaarxiv icon

Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost

Add code
May 26, 2022
Figure 1 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Figure 2 for Distributed Contextual Linear Bandits with Minimax Optimal Communication Cost
Viaarxiv icon