Alert button
Picture for Benjamin Van Roy

Benjamin Van Roy

Alert button

Provably Efficient Reinforcement Learning with Aggregated States

Add code
Bookmark button
Alert button
Dec 13, 2019
Shi Dong, Benjamin Van Roy, Zhengyuan Zhou

Viaarxiv icon

Information-Theoretic Confidence Bounds for Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 21, 2019
Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Information-Theoretic Confidence Bounds for Reinforcement Learning
Viaarxiv icon

Comments on the Du-Kakade-Wang-Yang Lower Bounds

Add code
Bookmark button
Alert button
Nov 18, 2019
Benjamin Van Roy, Shi Dong

Figure 1 for Comments on the Du-Kakade-Wang-Yang Lower Bounds
Viaarxiv icon

Behaviour Suite for Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt

Figure 1 for Behaviour Suite for Reinforcement Learning
Figure 2 for Behaviour Suite for Reinforcement Learning
Figure 3 for Behaviour Suite for Reinforcement Learning
Figure 4 for Behaviour Suite for Reinforcement Learning
Viaarxiv icon

On the Performance of Thompson Sampling on Logistic Bandits

Add code
Bookmark button
Alert button
May 12, 2019
Shi Dong, Tengyu Ma, Benjamin Van Roy

Figure 1 for On the Performance of Thompson Sampling on Logistic Bandits
Viaarxiv icon

An Information-Theoretic Analysis for Thompson Sampling with Many Actions

Add code
Bookmark button
Alert button
Oct 01, 2018
Shi Dong, Benjamin Van Roy

Figure 1 for An Information-Theoretic Analysis for Thompson Sampling with Many Actions
Viaarxiv icon

Deep Exploration via Randomized Value Functions

Add code
Bookmark button
Alert button
Jun 06, 2018
Ian Osband, Benjamin Van Roy, Daniel Russo, Zheng Wen

Figure 1 for Deep Exploration via Randomized Value Functions
Figure 2 for Deep Exploration via Randomized Value Functions
Figure 3 for Deep Exploration via Randomized Value Functions
Figure 4 for Deep Exploration via Randomized Value Functions
Viaarxiv icon

Scalable Coordinated Exploration in Concurrent Reinforcement Learning

Add code
Bookmark button
Alert button
May 23, 2018
Maria Dimakopoulou, Ian Osband, Benjamin Van Roy

Figure 1 for Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Figure 2 for Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Figure 3 for Scalable Coordinated Exploration in Concurrent Reinforcement Learning
Viaarxiv icon

Satisficing in Time-Sensitive Bandit Learning

Add code
Bookmark button
Alert button
Mar 07, 2018
Daniel Russo, Benjamin Van Roy

Figure 1 for Satisficing in Time-Sensitive Bandit Learning
Viaarxiv icon