Alert button
Picture for Martha White

Martha White

Alert button

Meta-Learning Representations for Continual Learning

Add code
Bookmark button
Alert button
May 29, 2019
Khurram Javed, Martha White

Figure 1 for Meta-Learning Representations for Continual Learning
Figure 2 for Meta-Learning Representations for Continual Learning
Figure 3 for Meta-Learning Representations for Continual Learning
Figure 4 for Meta-Learning Representations for Continual Learning
Viaarxiv icon

Planning with Expectation Models

Add code
Bookmark button
Alert button
Apr 03, 2019
Yi Wan, Muhammad Zaheer, Adam White, Martha White, Richard S. Sutton

Figure 1 for Planning with Expectation Models
Figure 2 for Planning with Expectation Models
Figure 3 for Planning with Expectation Models
Viaarxiv icon

Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling

Add code
Bookmark button
Alert button
Dec 03, 2018
Minghan Li, Tanli Zuo, Ruicheng Li, Martha White, Weishi Zheng

Figure 1 for Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling
Figure 2 for Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling
Figure 3 for Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling
Figure 4 for Accelerating Large Scale Knowledge Distillation via Dynamic Importance Sampling
Viaarxiv icon

An Off-policy Policy Gradient Theorem Using Emphatic Weightings

Add code
Bookmark button
Alert button
Nov 22, 2018
Ehsan Imani, Eric Graves, Martha White

Figure 1 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Figure 2 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Figure 3 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Figure 4 for An Off-policy Policy Gradient Theorem Using Emphatic Weightings
Viaarxiv icon

The Barbados 2018 List of Open Issues in Continual Learning

Add code
Bookmark button
Alert button
Nov 16, 2018
Tom Schaul, Hado van Hasselt, Joseph Modayil, Martha White, Adam White, Pierre-Luc Bacon, Jean Harb, Shibl Mourad, Marc Bellemare, Doina Precup

Viaarxiv icon

Context-Dependent Upper-Confidence Bounds for Directed Exploration

Add code
Bookmark button
Alert button
Nov 15, 2018
Raksha Kumaraswamy, Matthew Schlegel, Adam White, Martha White

Figure 1 for Context-Dependent Upper-Confidence Bounds for Directed Exploration
Figure 2 for Context-Dependent Upper-Confidence Bounds for Directed Exploration
Viaarxiv icon

The Utility of Sparse Representations for Control in Reinforcement Learning

Add code
Bookmark button
Alert button
Nov 15, 2018
Vincent Liu, Raksha Kumaraswamy, Lei Le, Martha White

Figure 1 for The Utility of Sparse Representations for Control in Reinforcement Learning
Figure 2 for The Utility of Sparse Representations for Control in Reinforcement Learning
Figure 3 for The Utility of Sparse Representations for Control in Reinforcement Learning
Figure 4 for The Utility of Sparse Representations for Control in Reinforcement Learning
Viaarxiv icon

Online Off-policy Prediction

Add code
Bookmark button
Alert button
Nov 06, 2018
Sina Ghiassian, Andrew Patterson, Martha White, Richard S. Sutton, Adam White

Figure 1 for Online Off-policy Prediction
Figure 2 for Online Off-policy Prediction
Figure 3 for Online Off-policy Prediction
Figure 4 for Online Off-policy Prediction
Viaarxiv icon

Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces

Add code
Bookmark button
Alert button
Oct 22, 2018
Sungsu Lim, Ajin Joseph, Lei Le, Yangchen Pan, Martha White

Figure 1 for Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces
Figure 2 for Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces
Figure 3 for Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces
Figure 4 for Actor-Expert: A Framework for using Action-Value Methods in Continuous Action Spaces
Viaarxiv icon

High-confidence error estimates for learned value functions

Add code
Bookmark button
Alert button
Aug 28, 2018
Touqir Sajed, Wesley Chung, Martha White

Figure 1 for High-confidence error estimates for learned value functions
Figure 2 for High-confidence error estimates for learned value functions
Viaarxiv icon