Alert button
Picture for Sina Ghiassian

Sina Ghiassian

Alert button

On the Importance of Uncertainty in Decision-Making with Large Language Models

Add code
Bookmark button
Alert button
Apr 03, 2024
Nicolò Felicioni, Lucas Maystre, Sina Ghiassian, Kamil Ciosek

Viaarxiv icon

In-context Exploration-Exploitation for Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 11, 2024
Zhenwen Dai, Federico Tomasi, Sina Ghiassian

Figure 1 for In-context Exploration-Exploitation for Reinforcement Learning
Figure 2 for In-context Exploration-Exploitation for Reinforcement Learning
Figure 3 for In-context Exploration-Exploitation for Reinforcement Learning
Viaarxiv icon

Auxiliary task discovery through generate-and-test

Add code
Bookmark button
Alert button
Oct 25, 2022
Banafsheh Rafiee, Sina Ghiassian, Jun Jin, Richard Sutton, Jun Luo, Adam White

Figure 1 for Auxiliary task discovery through generate-and-test
Figure 2 for Auxiliary task discovery through generate-and-test
Figure 3 for Auxiliary task discovery through generate-and-test
Figure 4 for Auxiliary task discovery through generate-and-test
Viaarxiv icon

Importance Sampling Placement in Off-Policy Temporal-Difference Methods

Add code
Bookmark button
Alert button
Mar 18, 2022
Eric Graves, Sina Ghiassian

Figure 1 for Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Figure 2 for Importance Sampling Placement in Off-Policy Temporal-Difference Methods
Viaarxiv icon

An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment

Add code
Bookmark button
Alert button
Sep 10, 2021
Sina Ghiassian, Richard S. Sutton

Figure 1 for An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Figure 2 for An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Figure 3 for An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Figure 4 for An Empirical Comparison of Off-policy Prediction Learning Algorithms in the Four Rooms Environment
Viaarxiv icon

An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task

Add code
Bookmark button
Alert button
Jun 11, 2021
Sina Ghiassian, Richard S. Sutton

Figure 1 for An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Figure 2 for An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Figure 3 for An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Figure 4 for An Empirical Comparison of Off-policy Prediction Learning Algorithms on the Collision Task
Viaarxiv icon

A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 28, 2021
Andrew Patterson, Adam White, Sina Ghiassian, Martha White

Figure 1 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Figure 2 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Figure 3 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Figure 4 for A Generalized Projected Bellman Error for Off-policy Value Estimation in Reinforcement Learning
Viaarxiv icon

Does Standard Backpropagation Forget Less Catastrophically Than Adam?

Add code
Bookmark button
Alert button
Feb 20, 2021
Dylan R. Ashley, Sina Ghiassian, Richard S. Sutton

Figure 1 for Does Standard Backpropagation Forget Less Catastrophically Than Adam?
Figure 2 for Does Standard Backpropagation Forget Less Catastrophically Than Adam?
Figure 3 for Does Standard Backpropagation Forget Less Catastrophically Than Adam?
Figure 4 for Does Standard Backpropagation Forget Less Catastrophically Than Adam?
Viaarxiv icon