Picture for Lihong Li

Lihong Li

Deep Reinforcement Learning with a Natural Language Action Space

Add code
Jun 08, 2016
Figure 1 for Deep Reinforcement Learning with a Natural Language Action Space
Figure 2 for Deep Reinforcement Learning with a Natural Language Action Space
Figure 3 for Deep Reinforcement Learning with a Natural Language Action Space
Figure 4 for Deep Reinforcement Learning with a Natural Language Action Space
Viaarxiv icon

Doubly Robust Off-policy Value Evaluation for Reinforcement Learning

Add code
May 26, 2016
Figure 1 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Figure 2 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Figure 3 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Figure 4 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Viaarxiv icon

Recurrent Reinforcement Learning: A Hybrid Approach

Add code
Nov 19, 2015
Figure 1 for Recurrent Reinforcement Learning: A Hybrid Approach
Figure 2 for Recurrent Reinforcement Learning: A Hybrid Approach
Figure 3 for Recurrent Reinforcement Learning: A Hybrid Approach
Figure 4 for Recurrent Reinforcement Learning: A Hybrid Approach
Viaarxiv icon

The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning

Add code
Sep 21, 2015
Figure 1 for The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning
Figure 2 for The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning
Viaarxiv icon

Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems

Add code
Apr 28, 2015
Figure 1 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Figure 2 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Figure 3 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Figure 4 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Viaarxiv icon

Doubly Robust Policy Evaluation and Optimization

Add code
Mar 10, 2015
Figure 1 for Doubly Robust Policy Evaluation and Optimization
Figure 2 for Doubly Robust Policy Evaluation and Optimization
Figure 3 for Doubly Robust Policy Evaluation and Optimization
Figure 4 for Doubly Robust Policy Evaluation and Optimization
Viaarxiv icon

Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits

Add code
Oct 14, 2014
Figure 1 for Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Viaarxiv icon

On Minimax Optimal Offline Policy Evaluation

Add code
Sep 12, 2014
Figure 1 for On Minimax Optimal Offline Policy Evaluation
Viaarxiv icon

Counterfactual Estimation and Optimization of Click Metrics for Search Engines

Add code
Mar 12, 2014
Figure 1 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Figure 2 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Figure 3 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Figure 4 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Viaarxiv icon

Efficient Online Bootstrapping for Large Scale Learning

Add code
Dec 18, 2013
Figure 1 for Efficient Online Bootstrapping for Large Scale Learning
Figure 2 for Efficient Online Bootstrapping for Large Scale Learning
Figure 3 for Efficient Online Bootstrapping for Large Scale Learning
Figure 4 for Efficient Online Bootstrapping for Large Scale Learning
Viaarxiv icon