Alert button
Picture for Lihong Li

Lihong Li

Alert button

Recurrent Reinforcement Learning: A Hybrid Approach

Add code
Bookmark button
Alert button
Nov 19, 2015
Xiujun Li, Lihong Li, Jianfeng Gao, Xiaodong He, Jianshu Chen, Li Deng, Ji He

Figure 1 for Recurrent Reinforcement Learning: A Hybrid Approach
Figure 2 for Recurrent Reinforcement Learning: A Hybrid Approach
Figure 3 for Recurrent Reinforcement Learning: A Hybrid Approach
Figure 4 for Recurrent Reinforcement Learning: A Hybrid Approach
Viaarxiv icon

The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 21, 2015
Emma Brunskill, Lihong Li

Figure 1 for The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning
Figure 2 for The Online Coupon-Collector Problem and Its Application to Lifelong Reinforcement Learning
Viaarxiv icon

Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems

Add code
Bookmark button
Alert button
Apr 28, 2015
Dragomir Yankov, Pavel Berkhin, Lihong Li

Figure 1 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Figure 2 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Figure 3 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Figure 4 for Evaluation of Explore-Exploit Policies in Multi-result Ranking Systems
Viaarxiv icon

Doubly Robust Policy Evaluation and Optimization

Add code
Bookmark button
Alert button
Mar 10, 2015
Miroslav Dudík, Dumitru Erhan, John Langford, Lihong Li

Figure 1 for Doubly Robust Policy Evaluation and Optimization
Figure 2 for Doubly Robust Policy Evaluation and Optimization
Figure 3 for Doubly Robust Policy Evaluation and Optimization
Figure 4 for Doubly Robust Policy Evaluation and Optimization
Viaarxiv icon

Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits

Add code
Bookmark button
Alert button
Oct 14, 2014
Alekh Agarwal, Daniel Hsu, Satyen Kale, John Langford, Lihong Li, Robert E. Schapire

Figure 1 for Taming the Monster: A Fast and Simple Algorithm for Contextual Bandits
Viaarxiv icon

On Minimax Optimal Offline Policy Evaluation

Add code
Bookmark button
Alert button
Sep 12, 2014
Lihong Li, Remi Munos, Csaba Szepesvari

Figure 1 for On Minimax Optimal Offline Policy Evaluation
Viaarxiv icon

Counterfactual Estimation and Optimization of Click Metrics for Search Engines

Add code
Bookmark button
Alert button
Mar 12, 2014
Lihong Li, Shunbao Chen, Jim Kleban, Ankur Gupta

Figure 1 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Figure 2 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Figure 3 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Figure 4 for Counterfactual Estimation and Optimization of Click Metrics for Search Engines
Viaarxiv icon

Efficient Online Bootstrapping for Large Scale Learning

Add code
Bookmark button
Alert button
Dec 18, 2013
Zhen Qin, Vaclav Petricek, Nikos Karampatziakis, Lihong Li, John Langford

Figure 1 for Efficient Online Bootstrapping for Large Scale Learning
Figure 2 for Efficient Online Bootstrapping for Large Scale Learning
Figure 3 for Efficient Online Bootstrapping for Large Scale Learning
Figure 4 for Efficient Online Bootstrapping for Large Scale Learning
Viaarxiv icon

Generalized Thompson Sampling for Contextual Bandits

Add code
Bookmark button
Alert button
Oct 27, 2013
Lihong Li

Viaarxiv icon

Sample Complexity of Multi-task Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 26, 2013
Emma Brunskill, Lihong Li

Figure 1 for Sample Complexity of Multi-task Reinforcement Learning
Figure 2 for Sample Complexity of Multi-task Reinforcement Learning
Viaarxiv icon