Picture for Christoph Dann

Christoph Dann

Neural Active Learning with Performance Guarantees

Add code
Jun 06, 2021
Viaarxiv icon

Regret Bound Balancing and Elimination for Model Selection in Bandits and RL

Add code
Dec 24, 2020
Figure 1 for Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
Figure 2 for Regret Bound Balancing and Elimination for Model Selection in Bandits and RL
Viaarxiv icon

Reinforcement Learning with Feedback Graphs

Add code
May 07, 2020
Figure 1 for Reinforcement Learning with Feedback Graphs
Figure 2 for Reinforcement Learning with Feedback Graphs
Figure 3 for Reinforcement Learning with Feedback Graphs
Figure 4 for Reinforcement Learning with Feedback Graphs
Viaarxiv icon

Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy

Add code
Nov 05, 2019
Figure 1 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Figure 2 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Figure 3 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Figure 4 for Being Optimistic to Be Conservative: Quickly Learning a CVaR Policy
Viaarxiv icon

Policy Certificates: Towards Accountable Reinforcement Learning

Add code
Nov 07, 2018
Figure 1 for Policy Certificates: Towards Accountable Reinforcement Learning
Figure 2 for Policy Certificates: Towards Accountable Reinforcement Learning
Figure 3 for Policy Certificates: Towards Accountable Reinforcement Learning
Figure 4 for Policy Certificates: Towards Accountable Reinforcement Learning
Viaarxiv icon

On Oracle-Efficient PAC RL with Rich Observations

Add code
Oct 31, 2018
Figure 1 for On Oracle-Efficient PAC RL with Rich Observations
Viaarxiv icon

Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning

Add code
Jan 02, 2018
Figure 1 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Figure 2 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Figure 3 for Unifying PAC and Regret: Uniform PAC Bounds for Episodic Reinforcement Learning
Viaarxiv icon

Decoupling Learning Rules from Representations

Add code
Jun 09, 2017
Figure 1 for Decoupling Learning Rules from Representations
Figure 2 for Decoupling Learning Rules from Representations
Viaarxiv icon

Sample Efficient Policy Search for Optimal Stopping Domains

Add code
May 24, 2017
Figure 1 for Sample Efficient Policy Search for Optimal Stopping Domains
Figure 2 for Sample Efficient Policy Search for Optimal Stopping Domains
Figure 3 for Sample Efficient Policy Search for Optimal Stopping Domains
Figure 4 for Sample Efficient Policy Search for Optimal Stopping Domains
Viaarxiv icon

Memory Lens: How Much Memory Does an Agent Use?

Add code
Nov 21, 2016
Figure 1 for Memory Lens: How Much Memory Does an Agent Use?
Viaarxiv icon