Alert button
Picture for Benjamin Van Roy

Benjamin Van Roy

Alert button

Scalable Neural Contextual Bandit for Recommender Systems

Add code
Bookmark button
Alert button
Jun 26, 2023
Zheqing Zhu, Benjamin Van Roy

Figure 1 for Scalable Neural Contextual Bandit for Recommender Systems
Figure 2 for Scalable Neural Contextual Bandit for Recommender Systems
Figure 3 for Scalable Neural Contextual Bandit for Recommender Systems
Figure 4 for Scalable Neural Contextual Bandit for Recommender Systems
Viaarxiv icon

Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models

Add code
Bookmark button
Alert button
May 19, 2023
Wanqiao Xu, Shi Dong, Dilip Arumugam, Benjamin Van Roy

Figure 1 for Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Figure 2 for Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Figure 3 for Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Figure 4 for Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models
Viaarxiv icon

Bayesian Reinforcement Learning with Limited Cognitive Load

Add code
Bookmark button
Alert button
May 05, 2023
Dilip Arumugam, Mark K. Ho, Noah D. Goodman, Benjamin Van Roy

Figure 1 for Bayesian Reinforcement Learning with Limited Cognitive Load
Figure 2 for Bayesian Reinforcement Learning with Limited Cognitive Load
Figure 3 for Bayesian Reinforcement Learning with Limited Cognitive Load
Figure 4 for Bayesian Reinforcement Learning with Limited Cognitive Load
Viaarxiv icon

A Definition of Non-Stationary Bandits

Add code
Bookmark button
Alert button
Feb 23, 2023
Yueyang Liu, Benjamin Van Roy, Kuang Xu

Viaarxiv icon

Approximate Thompson Sampling via Epistemic Neural Networks

Add code
Bookmark button
Alert button
Feb 18, 2023
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 2 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 3 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 4 for Approximate Thompson Sampling via Epistemic Neural Networks
Viaarxiv icon

Leveraging Demonstrations to Improve Online Learning: Quality Matters

Add code
Bookmark button
Alert button
Feb 08, 2023
Botao Hao, Rahul Jain, Tor Lattimore, Benjamin Van Roy, Zheng Wen

Figure 1 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 2 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 3 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Figure 4 for Leveraging Demonstrations to Improve Online Learning: Quality Matters
Viaarxiv icon

Inclusive Artificial Intelligence

Add code
Bookmark button
Alert button
Dec 24, 2022
Dilip Arumugam, Shi Dong, Benjamin Van Roy

Figure 1 for Inclusive Artificial Intelligence
Viaarxiv icon

An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws

Add code
Bookmark button
Alert button
Dec 02, 2022
Hong Jun Jeon, Benjamin Van Roy

Figure 1 for An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws
Figure 2 for An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws
Figure 3 for An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws
Figure 4 for An Information-Theoretic Analysis of Compute-Optimal Neural Scaling Laws
Viaarxiv icon

Posterior Sampling for Continuing Environments

Add code
Bookmark button
Alert button
Nov 29, 2022
Wanqiao Xu, Shi Dong, Benjamin Van Roy

Viaarxiv icon