Alert button
Picture for Benjamin Van Roy

Benjamin Van Roy

Alert button

Reinforcement Learning, Bit by Bit

Add code
Bookmark button
Alert button
Mar 14, 2021
Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen

Figure 1 for Reinforcement Learning, Bit by Bit
Figure 2 for Reinforcement Learning, Bit by Bit
Figure 3 for Reinforcement Learning, Bit by Bit
Figure 4 for Reinforcement Learning, Bit by Bit
Viaarxiv icon

Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State

Add code
Bookmark button
Alert button
Mar 08, 2021
Shi Dong, Benjamin Van Roy, Zhengyuan Zhou

Figure 1 for Simple Agent, Complex Environment: Efficient Reinforcement Learning with Agent State
Viaarxiv icon

A Bit Better? Quantifying Information for Bandit Learning

Add code
Bookmark button
Alert button
Feb 18, 2021
Adithya M. Devraj, Benjamin Van Roy, Kuang Xu

Figure 1 for A Bit Better? Quantifying Information for Bandit Learning
Figure 2 for A Bit Better? Quantifying Information for Bandit Learning
Figure 3 for A Bit Better? Quantifying Information for Bandit Learning
Figure 4 for A Bit Better? Quantifying Information for Bandit Learning
Viaarxiv icon

Deciding What to Learn: A Rate-Distortion Approach

Add code
Bookmark button
Alert button
Jan 15, 2021
Dilip Arumugam, Benjamin Van Roy

Figure 1 for Deciding What to Learn: A Rate-Distortion Approach
Figure 2 for Deciding What to Learn: A Rate-Distortion Approach
Figure 3 for Deciding What to Learn: A Rate-Distortion Approach
Viaarxiv icon

Randomized Value Functions via Posterior State-Abstraction Sampling

Add code
Bookmark button
Alert button
Oct 05, 2020
Dilip Arumugam, Benjamin Van Roy

Figure 1 for Randomized Value Functions via Posterior State-Abstraction Sampling
Figure 2 for Randomized Value Functions via Posterior State-Abstraction Sampling
Viaarxiv icon

Hypermodels for Exploration

Add code
Bookmark button
Alert button
Jun 12, 2020
Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy

Figure 1 for Hypermodels for Exploration
Figure 2 for Hypermodels for Exploration
Figure 3 for Hypermodels for Exploration
Figure 4 for Hypermodels for Exploration
Viaarxiv icon

Langevin DQN

Add code
Bookmark button
Alert button
Feb 17, 2020
Vikranth Dwaracherla, Benjamin Van Roy

Figure 1 for Langevin DQN
Figure 2 for Langevin DQN
Figure 3 for Langevin DQN
Figure 4 for Langevin DQN
Viaarxiv icon