Picture for Emma Brunskill

Emma Brunskill

Stanford University

Separating value functions across time-scales

Add code
Feb 08, 2019
Figure 1 for Separating value functions across time-scales
Figure 2 for Separating value functions across time-scales
Figure 3 for Separating value functions across time-scales
Figure 4 for Separating value functions across time-scales
Viaarxiv icon

Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds

Add code
Jan 01, 2019
Figure 1 for Tighter Problem-Dependent Regret Bounds in Reinforcement Learning without Domain Knowledge using Value Function Bounds
Viaarxiv icon

Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research

Add code
Dec 03, 2018
Figure 1 for Distilling Information from a Flood: A Possibility for the Use of Meta-Analysis and Systematic Review in Machine Learning Research
Viaarxiv icon

Policy Certificates: Towards Accountable Reinforcement Learning

Add code
Nov 07, 2018
Figure 1 for Policy Certificates: Towards Accountable Reinforcement Learning
Figure 2 for Policy Certificates: Towards Accountable Reinforcement Learning
Figure 3 for Policy Certificates: Towards Accountable Reinforcement Learning
Figure 4 for Policy Certificates: Towards Accountable Reinforcement Learning
Viaarxiv icon

Representation Balancing MDPs for Off-Policy Policy Evaluation

Add code
Oct 31, 2018
Figure 1 for Representation Balancing MDPs for Off-Policy Policy Evaluation
Figure 2 for Representation Balancing MDPs for Off-Policy Policy Evaluation
Viaarxiv icon

When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms

Add code
Aug 04, 2018
Figure 1 for When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms
Figure 2 for When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms
Viaarxiv icon

Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters

Add code
Jul 10, 2018
Figure 1 for Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters
Figure 2 for Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters
Figure 3 for Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters
Figure 4 for Behaviour Policy Estimation in Off-Policy Policy Evaluation: Calibration Matters
Viaarxiv icon

Sample-Efficient Deep RL with Generative Adversarial Tree Search

Add code
Jun 15, 2018
Figure 1 for Sample-Efficient Deep RL with Generative Adversarial Tree Search
Figure 2 for Sample-Efficient Deep RL with Generative Adversarial Tree Search
Figure 3 for Sample-Efficient Deep RL with Generative Adversarial Tree Search
Figure 4 for Sample-Efficient Deep RL with Generative Adversarial Tree Search
Viaarxiv icon

Strategic Object Oriented Reinforcement Learning

Add code
Jun 01, 2018
Figure 1 for Strategic Object Oriented Reinforcement Learning
Figure 2 for Strategic Object Oriented Reinforcement Learning
Figure 3 for Strategic Object Oriented Reinforcement Learning
Figure 4 for Strategic Object Oriented Reinforcement Learning
Viaarxiv icon

Efficient Exploration through Bayesian Deep Q-Networks

Add code
Feb 13, 2018
Figure 1 for Efficient Exploration through Bayesian Deep Q-Networks
Figure 2 for Efficient Exploration through Bayesian Deep Q-Networks
Figure 3 for Efficient Exploration through Bayesian Deep Q-Networks
Figure 4 for Efficient Exploration through Bayesian Deep Q-Networks
Viaarxiv icon