Alert button
Picture for Benjamin Van Roy

Benjamin Van Roy

Alert button

Adaptive Crowdsourcing Via Self-Supervised Learning

Add code
Bookmark button
Alert button
Feb 02, 2024
Anmol Kagrecha, Henrik Marklund, Benjamin Van Roy, Hong Jun Jeon, Richard Zeckhauser

Viaarxiv icon

Efficient Exploration for LLMs

Add code
Bookmark button
Alert button
Feb 01, 2024
Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy

Viaarxiv icon

An Information-Theoretic Analysis of In-Context Learning

Add code
Bookmark button
Alert button
Jan 28, 2024
Hong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy

Viaarxiv icon

RLHF and IIA: Perverse Incentives

Add code
Bookmark button
Alert button
Dec 02, 2023
Wanqiao Xu, Shi Dong, Xiuyuan Lu, Grace Lam, Zheng Wen, Benjamin Van Roy

Viaarxiv icon

Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling

Add code
Bookmark button
Alert button
Oct 14, 2023
Zheqing Zhu, Yueyang Liu, Xu Kuang, Benjamin Van Roy

Figure 1 for Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Figure 2 for Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Figure 3 for Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Figure 4 for Non-Stationary Contextual Bandit Learning via Neural Predictive Ensemble Sampling
Viaarxiv icon

Maintaining Plasticity via Regenerative Regularization

Add code
Bookmark button
Alert button
Aug 23, 2023
Saurabh Kumar, Henrik Marklund, Benjamin Van Roy

Viaarxiv icon

A Definition of Continual Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 20, 2023
David Abel, André Barreto, Benjamin Van Roy, Doina Precup, Hado van Hasselt, Satinder Singh

Figure 1 for A Definition of Continual Reinforcement Learning
Figure 2 for A Definition of Continual Reinforcement Learning
Figure 3 for A Definition of Continual Reinforcement Learning
Viaarxiv icon

On the Convergence of Bounded Agents

Add code
Bookmark button
Alert button
Jul 20, 2023
David Abel, André Barreto, Hado van Hasselt, Benjamin Van Roy, Doina Precup, Satinder Singh

Figure 1 for On the Convergence of Bounded Agents
Viaarxiv icon

Continual Learning as Computationally Constrained Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 10, 2023
Saurabh Kumar, Henrik Marklund, Ashish Rao, Yifan Zhu, Hong Jun Jeon, Yueyang Liu, Benjamin Van Roy

Viaarxiv icon