Picture for Lihong Li

Lihong Li

Generalized Thompson Sampling for Contextual Bandits

Add code
Oct 27, 2013
Viaarxiv icon

Sample Complexity of Multi-task Reinforcement Learning

Add code
Sep 26, 2013
Figure 1 for Sample Complexity of Multi-task Reinforcement Learning
Figure 2 for Sample Complexity of Multi-task Reinforcement Learning
Viaarxiv icon

Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits

Add code
Oct 16, 2012
Figure 1 for Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits
Figure 2 for Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits
Figure 3 for Sample-efficient Nonstationary Policy Evaluation for Contextual Bandits
Viaarxiv icon

Incremental Model-based Learners With Formal Learning-Time Guarantees

Add code
Jun 27, 2012
Figure 1 for Incremental Model-based Learners With Formal Learning-Time Guarantees
Viaarxiv icon

CORL: A Continuous-state Offset-dynamics Reinforcement Learner

Add code
Jun 13, 2012
Figure 1 for CORL: A Continuous-state Offset-dynamics Reinforcement Learner
Figure 2 for CORL: A Continuous-state Offset-dynamics Reinforcement Learner
Figure 3 for CORL: A Continuous-state Offset-dynamics Reinforcement Learner
Viaarxiv icon

A Bayesian Sampling Approach to Exploration in Reinforcement Learning

Add code
May 09, 2012
Figure 1 for A Bayesian Sampling Approach to Exploration in Reinforcement Learning
Figure 2 for A Bayesian Sampling Approach to Exploration in Reinforcement Learning
Figure 3 for A Bayesian Sampling Approach to Exploration in Reinforcement Learning
Figure 4 for A Bayesian Sampling Approach to Exploration in Reinforcement Learning
Viaarxiv icon

A Contextual-Bandit Approach to Personalized News Article Recommendation

Add code
Mar 01, 2012
Figure 1 for A Contextual-Bandit Approach to Personalized News Article Recommendation
Figure 2 for A Contextual-Bandit Approach to Personalized News Article Recommendation
Figure 3 for A Contextual-Bandit Approach to Personalized News Article Recommendation
Figure 4 for A Contextual-Bandit Approach to Personalized News Article Recommendation
Viaarxiv icon

Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms

Add code
Mar 01, 2012
Figure 1 for Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Figure 2 for Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Figure 3 for Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Figure 4 for Unbiased Offline Evaluation of Contextual-bandit-based News Article Recommendation Algorithms
Viaarxiv icon

Contextual Bandit Algorithms with Supervised Learning Guarantees

Add code
Oct 27, 2011
Viaarxiv icon

Doubly Robust Policy Evaluation and Learning

Add code
May 06, 2011
Figure 1 for Doubly Robust Policy Evaluation and Learning
Figure 2 for Doubly Robust Policy Evaluation and Learning
Figure 3 for Doubly Robust Policy Evaluation and Learning
Figure 4 for Doubly Robust Policy Evaluation and Learning
Viaarxiv icon