Alert button
Picture for Lihong Li

Lihong Li

Alert button

Stochastic Variance Reduction Methods for Policy Evaluation

Jun 09, 2017
Simon S. Du, Jianshu Chen, Lihong Li, Lin Xiao, Dengyong Zhou

Figure 1 for Stochastic Variance Reduction Methods for Policy Evaluation
Figure 2 for Stochastic Variance Reduction Methods for Policy Evaluation
Figure 3 for Stochastic Variance Reduction Methods for Policy Evaluation
Figure 4 for Stochastic Variance Reduction Methods for Policy Evaluation
Viaarxiv icon

Scaffolding Networks: Incremental Learning and Teaching Through Questioning

May 19, 2017
Asli Celikyilmaz, Li Deng, Lihong Li, Chong Wang

Figure 1 for Scaffolding Networks: Incremental Learning and Teaching Through Questioning
Figure 2 for Scaffolding Networks: Incremental Learning and Teaching Through Questioning
Figure 3 for Scaffolding Networks: Incremental Learning and Teaching Through Questioning
Figure 4 for Scaffolding Networks: Incremental Learning and Teaching Through Questioning
Viaarxiv icon

Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access

Apr 20, 2017
Bhuwan Dhingra, Lihong Li, Xiujun Li, Jianfeng Gao, Yun-Nung Chen, Faisal Ahmed, Li Deng

Figure 1 for Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Figure 2 for Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Figure 3 for Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Figure 4 for Towards End-to-End Reinforcement Learning of Dialogue Agents for Information Access
Viaarxiv icon

Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems

Mar 21, 2017
Xiujun Li, Yun-Nung Chen, Lihong Li, Jianfeng Gao, Asli Celikyilmaz

Figure 1 for Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems
Figure 2 for Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems
Figure 3 for Investigation of Language Understanding Impact for Reinforcement Learning Based Dialogue Systems
Viaarxiv icon

Neuro-Symbolic Program Synthesis

Nov 06, 2016
Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli

Figure 1 for Neuro-Symbolic Program Synthesis
Figure 2 for Neuro-Symbolic Program Synthesis
Figure 3 for Neuro-Symbolic Program Synthesis
Figure 4 for Neuro-Symbolic Program Synthesis
Viaarxiv icon

Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads

Sep 17, 2016
Ji He, Mari Ostendorf, Xiaodong He, Jianshu Chen, Jianfeng Gao, Lihong Li, Li Deng

Figure 1 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads
Figure 2 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads
Figure 3 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads
Figure 4 for Deep Reinforcement Learning with a Combinatorial Action Space for Predicting Popular Reddit Threads
Viaarxiv icon

On the Prior Sensitivity of Thompson Sampling

Jul 21, 2016
Che-Yu Liu, Lihong Li

Figure 1 for On the Prior Sensitivity of Thompson Sampling
Viaarxiv icon

An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives

Jul 09, 2016
Shipra Agrawal, Nikhil R. Devanur, Lihong Li

Viaarxiv icon

Deep Reinforcement Learning with a Natural Language Action Space

Jun 08, 2016
Ji He, Jianshu Chen, Xiaodong He, Jianfeng Gao, Lihong Li, Li Deng, Mari Ostendorf

Figure 1 for Deep Reinforcement Learning with a Natural Language Action Space
Figure 2 for Deep Reinforcement Learning with a Natural Language Action Space
Figure 3 for Deep Reinforcement Learning with a Natural Language Action Space
Figure 4 for Deep Reinforcement Learning with a Natural Language Action Space
Viaarxiv icon

Doubly Robust Off-policy Value Evaluation for Reinforcement Learning

May 26, 2016
Nan Jiang, Lihong Li

Figure 1 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Figure 2 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Figure 3 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Figure 4 for Doubly Robust Off-policy Value Evaluation for Reinforcement Learning
Viaarxiv icon