Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox

Picture for Benjamin Van Roy

Benjamin Van Roy

Stanford University Department of Electrical Engineering

Fine-Tuning Language Models via Epistemic Neural Networks


Nov 03, 2022
Ian Osband, Seyed Mohammad Asghari, Benjamin Van Roy, Nat McAleese, John Aslanides, Geoffrey Irving


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

On Rate-Distortion Theory in Capacity-Limited Cognition & Reinforcement Learning


Oct 30, 2022
Dilip Arumugam, Mark K. Ho, Noah D. Goodman, Benjamin Van Roy

* Accepted to the NeurIPS Workshop on Information-Theoretic Principles in Cognitive Systems (InfoCog) 2022. arXiv admin note: text overlap with arXiv:2206.02072 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Is Stochastic Gradient Descent Near Optimal?


Oct 06, 2022
Yifan Zhu, Hong Jun Jeon, Benjamin Van Roy


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Robustness of Epinets against Distributional Shifts


Jul 01, 2022
Xiuyuan Lu, Ian Osband, Seyed Mohammad Asghari, Sven Gowal, Vikranth Dwaracherla, Zheng Wen, Benjamin Van Roy


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping


Jun 08, 2022
Vikranth Dwaracherla, Zheng Wen, Ian Osband, Xiuyuan Lu, Seyed Mohammad Asghari, Benjamin Van Roy


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning


Jun 04, 2022
Dilip Arumugam, Benjamin Van Roy


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning


Jun 04, 2022
Dilip Arumugam, Benjamin Van Roy

* Accepted to the Multi-Disciplinary Conference on Reinforcement Learning and Decision Making (RLDM) 2022 

   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Nonstationary Bandit Learning via Predictive Sampling


May 04, 2022
Yueyang Liu, Benjamin Van Roy, Kuang Xu


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email

Sample Complexity versus Depth: An Information Theoretic Analysis


Apr 07, 2022
Hong Jun Jeon, Benjamin Van Roy


   Access Paper or Ask Questions

  • Share via Twitter
  • Share via Facebook
  • Share via LinkedIn
  • Share via Whatsapp
  • Share via Messenger
  • Share via Email
1
2
3
4
5
6
7
>>