Alert button
Picture for Vikranth Dwaracherla

Vikranth Dwaracherla

Alert button

Efficient Exploration for LLMs

Add code
Bookmark button
Alert button
Feb 01, 2024
Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy

Viaarxiv icon

Approximate Thompson Sampling via Epistemic Neural Networks

Add code
Bookmark button
Alert button
Feb 18, 2023
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 2 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 3 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 4 for Approximate Thompson Sampling via Epistemic Neural Networks
Viaarxiv icon

Robustness of Epinets against Distributional Shifts

Add code
Bookmark button
Alert button
Jul 01, 2022
Xiuyuan Lu, Ian Osband, Seyed Mohammad Asghari, Sven Gowal, Vikranth Dwaracherla, Zheng Wen, Benjamin Van Roy

Figure 1 for Robustness of Epinets against Distributional Shifts
Figure 2 for Robustness of Epinets against Distributional Shifts
Figure 3 for Robustness of Epinets against Distributional Shifts
Figure 4 for Robustness of Epinets against Distributional Shifts
Viaarxiv icon

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping

Add code
Bookmark button
Alert button
Jun 08, 2022
Vikranth Dwaracherla, Zheng Wen, Ian Osband, Xiuyuan Lu, Seyed Mohammad Asghari, Benjamin Van Roy

Figure 1 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 2 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 3 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 4 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Viaarxiv icon

Evaluating High-Order Predictive Distributions in Deep Learning

Add code
Bookmark button
Alert button
Feb 28, 2022
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 2 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 3 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 4 for Evaluating High-Order Predictive Distributions in Deep Learning
Viaarxiv icon

Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?

Add code
Bookmark button
Alert button
Oct 09, 2021
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Botao Hao, Morteza Ibrahimi, Dieterich Lawson, Xiuyuan Lu, Brendan O'Donoghue, Benjamin Van Roy

Figure 1 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Figure 2 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Figure 3 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Figure 4 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Viaarxiv icon

Reinforcement Learning, Bit by Bit

Add code
Bookmark button
Alert button
Mar 14, 2021
Xiuyuan Lu, Benjamin Van Roy, Vikranth Dwaracherla, Morteza Ibrahimi, Ian Osband, Zheng Wen

Figure 1 for Reinforcement Learning, Bit by Bit
Figure 2 for Reinforcement Learning, Bit by Bit
Figure 3 for Reinforcement Learning, Bit by Bit
Figure 4 for Reinforcement Learning, Bit by Bit
Viaarxiv icon

Hypermodels for Exploration

Add code
Bookmark button
Alert button
Jun 12, 2020
Vikranth Dwaracherla, Xiuyuan Lu, Morteza Ibrahimi, Ian Osband, Zheng Wen, Benjamin Van Roy

Figure 1 for Hypermodels for Exploration
Figure 2 for Hypermodels for Exploration
Figure 3 for Hypermodels for Exploration
Figure 4 for Hypermodels for Exploration
Viaarxiv icon

Langevin DQN

Add code
Bookmark button
Alert button
Feb 17, 2020
Vikranth Dwaracherla, Benjamin Van Roy

Figure 1 for Langevin DQN
Figure 2 for Langevin DQN
Figure 3 for Langevin DQN
Figure 4 for Langevin DQN
Viaarxiv icon