Picture for Matteo Pirotta

Matteo Pirotta

Smoothing Policies and Safe Policy Gradients

Add code
May 08, 2019
Figure 1 for Smoothing Policies and Safe Policy Gradients
Figure 2 for Smoothing Policies and Safe Policy Gradients
Viaarxiv icon

Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes

Add code
Dec 11, 2018
Viaarxiv icon

Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning

Add code
Jul 06, 2018
Figure 1 for Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Figure 2 for Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Figure 3 for Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Figure 4 for Efficient Bias-Span-Constrained Exploration-Exploitation in Reinforcement Learning
Viaarxiv icon

Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes

Add code
Jul 06, 2018
Figure 1 for Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes
Figure 2 for Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes
Figure 3 for Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes
Figure 4 for Near Optimal Exploration-Exploitation in Non-Communicating Markov Decision Processes
Viaarxiv icon

Stochastic Variance-Reduced Policy Gradient

Add code
Jun 14, 2018
Figure 1 for Stochastic Variance-Reduced Policy Gradient
Viaarxiv icon

Importance Weighted Transfer of Samples in Reinforcement Learning

Add code
May 28, 2018
Figure 1 for Importance Weighted Transfer of Samples in Reinforcement Learning
Figure 2 for Importance Weighted Transfer of Samples in Reinforcement Learning
Figure 3 for Importance Weighted Transfer of Samples in Reinforcement Learning
Figure 4 for Importance Weighted Transfer of Samples in Reinforcement Learning
Viaarxiv icon

Cost-Sensitive Approach to Batch Size Adaptation for Gradient Descent

Add code
Dec 09, 2017
Figure 1 for Cost-Sensitive Approach to Batch Size Adaptation for Gradient Descent
Figure 2 for Cost-Sensitive Approach to Batch Size Adaptation for Gradient Descent
Figure 3 for Cost-Sensitive Approach to Batch Size Adaptation for Gradient Descent
Figure 4 for Cost-Sensitive Approach to Batch Size Adaptation for Gradient Descent
Viaarxiv icon

Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material

Add code
Nov 18, 2014
Figure 1 for Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material
Figure 2 for Multi-objective Reinforcement Learning with Continuous Pareto Frontier Approximation Supplementary Material
Viaarxiv icon