Picture for Alessandro Lazaric

Alessandro Lazaric

INRIA Lille - Nord Europe

Sequential Transfer in Multi-armed Bandit with Finite Set of Models

Add code
Jul 25, 2013
Figure 1 for Sequential Transfer in Multi-armed Bandit with Finite Set of Models
Figure 2 for Sequential Transfer in Multi-armed Bandit with Finite Set of Models
Figure 3 for Sequential Transfer in Multi-armed Bandit with Finite Set of Models
Figure 4 for Sequential Transfer in Multi-armed Bandit with Finite Set of Models
Viaarxiv icon

Regret Bounds for Reinforcement Learning with Policy Advice

Add code
Jul 17, 2013
Figure 1 for Regret Bounds for Reinforcement Learning with Policy Advice
Viaarxiv icon

Risk-Aversion in Multi-armed Bandits

Add code
Jan 09, 2013
Figure 1 for Risk-Aversion in Multi-armed Bandits
Figure 2 for Risk-Aversion in Multi-armed Bandits
Figure 3 for Risk-Aversion in Multi-armed Bandits
Figure 4 for Risk-Aversion in Multi-armed Bandits
Viaarxiv icon

A Dantzig Selector Approach to Temporal Difference Learning

Add code
Jun 27, 2012
Figure 1 for A Dantzig Selector Approach to Temporal Difference Learning
Figure 2 for A Dantzig Selector Approach to Temporal Difference Learning
Figure 3 for A Dantzig Selector Approach to Temporal Difference Learning
Figure 4 for A Dantzig Selector Approach to Temporal Difference Learning
Viaarxiv icon

Transfer from Multiple MDPs

Add code
Sep 01, 2011
Figure 1 for Transfer from Multiple MDPs
Figure 2 for Transfer from Multiple MDPs
Figure 3 for Transfer from Multiple MDPs
Figure 4 for Transfer from Multiple MDPs
Viaarxiv icon