Alert button
Picture for Richard Sutton

Richard Sutton

Alert button

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters

Add code
Bookmark button
Alert button
Feb 12, 2024
Arsalan Sharifnassab, Saber Salehkaleybar, Richard Sutton

Viaarxiv icon

Step-size Optimization for Continual Learning

Add code
Bookmark button
Alert button
Jan 30, 2024
Thomas Degris, Khurram Javed, Arsalan Sharifnassab, Yuxin Liu, Richard Sutton

Viaarxiv icon

Toward Efficient Gradient-Based Value Estimation

Add code
Bookmark button
Alert button
Jan 31, 2023
Arsalan Sharifnassab, Richard Sutton

Figure 1 for Toward Efficient Gradient-Based Value Estimation
Figure 2 for Toward Efficient Gradient-Based Value Estimation
Figure 3 for Toward Efficient Gradient-Based Value Estimation
Figure 4 for Toward Efficient Gradient-Based Value Estimation
Viaarxiv icon

Auxiliary task discovery through generate-and-test

Add code
Bookmark button
Alert button
Oct 25, 2022
Banafsheh Rafiee, Sina Ghiassian, Jun Jin, Richard Sutton, Jun Luo, Adam White

Figure 1 for Auxiliary task discovery through generate-and-test
Figure 2 for Auxiliary task discovery through generate-and-test
Figure 3 for Auxiliary task discovery through generate-and-test
Figure 4 for Auxiliary task discovery through generate-and-test
Viaarxiv icon

Prediction problems inspired by animal learning

Add code
Bookmark button
Alert button
Nov 09, 2020
Banafsheh Rafiee, Sina Ghiassian, Raksha Kumaraswamy, Richard Sutton, Elliot Ludvig, Adam White

Figure 1 for Prediction problems inspired by animal learning
Figure 2 for Prediction problems inspired by animal learning
Figure 3 for Prediction problems inspired by animal learning
Figure 4 for Prediction problems inspired by animal learning
Viaarxiv icon

Behaviour Suite for Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 13, 2019
Ian Osband, Yotam Doron, Matteo Hessel, John Aslanides, Eren Sezener, Andre Saraiva, Katrina McKinney, Tor Lattimore, Csaba Szepezvari, Satinder Singh, Benjamin Van Roy, Richard Sutton, David Silver, Hado Van Hasselt

Figure 1 for Behaviour Suite for Reinforcement Learning
Figure 2 for Behaviour Suite for Reinforcement Learning
Figure 3 for Behaviour Suite for Reinforcement Learning
Figure 4 for Behaviour Suite for Reinforcement Learning
Viaarxiv icon