Picture for Emma Brunskill

Emma Brunskill

Stanford University

Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning

Add code
Jan 02, 2018
Figure 1 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Figure 2 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Figure 3 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Viaarxiv icon

Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation

Add code
Dec 05, 2017
Figure 1 for Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation
Figure 2 for Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation
Figure 3 for Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation
Figure 4 for Using Options and Covariance Testing for Long Horizon Off-Policy Policy Evaluation
Viaarxiv icon

Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands

Add code
Nov 29, 2017
Figure 1 for Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands
Figure 2 for Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands
Figure 3 for Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands
Figure 4 for Generalized Grounding Graphs: A Probabilistic Framework for Understanding Grounded Commands
Viaarxiv icon

On Ensuring that Intelligent Machines Are Well-Behaved

Add code
Aug 17, 2017
Figure 1 for On Ensuring that Intelligent Machines Are Well-Behaved
Figure 2 for On Ensuring that Intelligent Machines Are Well-Behaved
Figure 3 for On Ensuring that Intelligent Machines Are Well-Behaved
Figure 4 for On Ensuring that Intelligent Machines Are Well-Behaved
Viaarxiv icon

Policy Gradient Methods for Reinforcement Learning with Function Approximation and Action-Dependent Baselines

Add code
Jun 20, 2017
Viaarxiv icon

Decoupling Learning Rules from Representations

Add code
Jun 09, 2017
Figure 1 for Decoupling Learning Rules from Representations
Figure 2 for Decoupling Learning Rules from Representations
Viaarxiv icon

Sample Efficient Policy Search for Optimal Stopping Domains

Add code
May 24, 2017
Figure 1 for Sample Efficient Policy Search for Optimal Stopping Domains
Figure 2 for Sample Efficient Policy Search for Optimal Stopping Domains
Figure 3 for Sample Efficient Policy Search for Optimal Stopping Domains
Figure 4 for Sample Efficient Policy Search for Optimal Stopping Domains
Viaarxiv icon

Sample Efficient Feature Selection for Factored MDPs

Add code
Mar 09, 2017
Figure 1 for Sample Efficient Feature Selection for Factored MDPs
Viaarxiv icon

Importance Sampling with Unequal Support

Add code
Nov 10, 2016
Figure 1 for Importance Sampling with Unequal Support
Figure 2 for Importance Sampling with Unequal Support
Figure 3 for Importance Sampling with Unequal Support
Figure 4 for Importance Sampling with Unequal Support
Viaarxiv icon

A PAC RL Algorithm for Episodic POMDPs

Add code
Jun 01, 2016
Figure 1 for A PAC RL Algorithm for Episodic POMDPs
Viaarxiv icon