Alert button
Picture for Seyed Mohammad Asghari

Seyed Mohammad Asghari

Alert button

Efficient Exploration for LLMs

Add code
Bookmark button
Alert button
Feb 01, 2024
Vikranth Dwaracherla, Seyed Mohammad Asghari, Botao Hao, Benjamin Van Roy

Viaarxiv icon

Approximate Thompson Sampling via Epistemic Neural Networks

Add code
Bookmark button
Alert button
Feb 18, 2023
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Morteza Ibrahimi, Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 2 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 3 for Approximate Thompson Sampling via Epistemic Neural Networks
Figure 4 for Approximate Thompson Sampling via Epistemic Neural Networks
Viaarxiv icon

Fine-Tuning Language Models via Epistemic Neural Networks

Add code
Bookmark button
Alert button
Nov 03, 2022
Ian Osband, Seyed Mohammad Asghari, Benjamin Van Roy, Nat McAleese, John Aslanides, Geoffrey Irving

Figure 1 for Fine-Tuning Language Models via Epistemic Neural Networks
Figure 2 for Fine-Tuning Language Models via Epistemic Neural Networks
Figure 3 for Fine-Tuning Language Models via Epistemic Neural Networks
Figure 4 for Fine-Tuning Language Models via Epistemic Neural Networks
Viaarxiv icon

Robustness of Epinets against Distributional Shifts

Add code
Bookmark button
Alert button
Jul 01, 2022
Xiuyuan Lu, Ian Osband, Seyed Mohammad Asghari, Sven Gowal, Vikranth Dwaracherla, Zheng Wen, Benjamin Van Roy

Figure 1 for Robustness of Epinets against Distributional Shifts
Figure 2 for Robustness of Epinets against Distributional Shifts
Figure 3 for Robustness of Epinets against Distributional Shifts
Figure 4 for Robustness of Epinets against Distributional Shifts
Viaarxiv icon

Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping

Add code
Bookmark button
Alert button
Jun 08, 2022
Vikranth Dwaracherla, Zheng Wen, Ian Osband, Xiuyuan Lu, Seyed Mohammad Asghari, Benjamin Van Roy

Figure 1 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 2 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 3 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Figure 4 for Ensembles for Uncertainty Estimation: Benefits of Prior Functions and Bootstrapping
Viaarxiv icon

Evaluating High-Order Predictive Distributions in Deep Learning

Add code
Bookmark button
Alert button
Feb 28, 2022
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Xiuyuan Lu, Benjamin Van Roy

Figure 1 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 2 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 3 for Evaluating High-Order Predictive Distributions in Deep Learning
Figure 4 for Evaluating High-Order Predictive Distributions in Deep Learning
Viaarxiv icon

Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?

Add code
Bookmark button
Alert button
Oct 09, 2021
Ian Osband, Zheng Wen, Seyed Mohammad Asghari, Vikranth Dwaracherla, Botao Hao, Morteza Ibrahimi, Dieterich Lawson, Xiuyuan Lu, Brendan O'Donoghue, Benjamin Van Roy

Figure 1 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Figure 2 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Figure 3 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Figure 4 for Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?
Viaarxiv icon

Regret Bounds for Decentralized Learning in Cooperative Multi-Agent Dynamical Systems

Add code
Bookmark button
Alert button
Jan 27, 2020
Seyed Mohammad Asghari, Yi Ouyang, Ashutosh Nayyar

Figure 1 for Regret Bounds for Decentralized Learning in Cooperative Multi-Agent Dynamical Systems
Figure 2 for Regret Bounds for Decentralized Learning in Cooperative Multi-Agent Dynamical Systems
Figure 3 for Regret Bounds for Decentralized Learning in Cooperative Multi-Agent Dynamical Systems
Figure 4 for Regret Bounds for Decentralized Learning in Cooperative Multi-Agent Dynamical Systems
Viaarxiv icon

Learning to Code: Coded Caching via Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 09, 2019
Navid Naderializadeh, Seyed Mohammad Asghari

Figure 1 for Learning to Code: Coded Caching via Deep Reinforcement Learning
Figure 2 for Learning to Code: Coded Caching via Deep Reinforcement Learning
Figure 3 for Learning to Code: Coded Caching via Deep Reinforcement Learning
Figure 4 for Learning to Code: Coded Caching via Deep Reinforcement Learning
Viaarxiv icon