Alert button
Picture for Kristopher De Asis

Kristopher De Asis

Alert button

Value-aware Importance Weighting for Off-policy Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 27, 2023
Kristopher De Asis, Eric Graves, Richard S. Sutton

Figure 1 for Value-aware Importance Weighting for Off-policy Reinforcement Learning
Figure 2 for Value-aware Importance Weighting for Off-policy Reinforcement Learning
Figure 3 for Value-aware Importance Weighting for Off-policy Reinforcement Learning
Figure 4 for Value-aware Importance Weighting for Off-policy Reinforcement Learning
Viaarxiv icon

Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 09, 2019
Kristopher De Asis, Alan Chan, Silviu Pitis, Richard S. Sutton, Daniel Graves

Figure 1 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Figure 2 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Figure 3 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Figure 4 for Fixed-Horizon Temporal Difference Methods for Stable Reinforcement Learning
Viaarxiv icon

Predicting Periodicity with Temporal Difference Learning

Add code
Bookmark button
Alert button
Sep 20, 2018
Kristopher De Asis, Brendan Bennett, Richard S. Sutton

Figure 1 for Predicting Periodicity with Temporal Difference Learning
Figure 2 for Predicting Periodicity with Temporal Difference Learning
Figure 3 for Predicting Periodicity with Temporal Difference Learning
Figure 4 for Predicting Periodicity with Temporal Difference Learning
Viaarxiv icon

Per-decision Multi-step Temporal Difference Learning with Control Variates

Add code
Bookmark button
Alert button
Jul 05, 2018
Kristopher De Asis, Richard S. Sutton

Figure 1 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Figure 2 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Figure 3 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Figure 4 for Per-decision Multi-step Temporal Difference Learning with Control Variates
Viaarxiv icon

Multi-step Reinforcement Learning: A Unifying Algorithm

Add code
Bookmark button
Alert button
Jun 11, 2018
Kristopher De Asis, J. Fernando Hernandez-Garcia, G. Zacharias Holland, Richard S. Sutton

Figure 1 for Multi-step Reinforcement Learning: A Unifying Algorithm
Figure 2 for Multi-step Reinforcement Learning: A Unifying Algorithm
Figure 3 for Multi-step Reinforcement Learning: A Unifying Algorithm
Figure 4 for Multi-step Reinforcement Learning: A Unifying Algorithm
Viaarxiv icon