Alert button
Picture for Lihong Li

Lihong Li

Alert button

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

Add code
Bookmark button
Alert button
Jun 05, 2018
Bo Dai, Albert Shaw, Lihong Li, Lin Xiao, Niao He, Zhen Liu, Jianshu Chen, Le Song

Figure 1 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Figure 2 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Figure 3 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Viaarxiv icon

Scalable Bilinear $π$ Learning Using State and Action Features

Add code
Bookmark button
Alert button
Apr 27, 2018
Yichen Chen, Lihong Li, Mengdi Wang

Viaarxiv icon

Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear

Add code
Bookmark button
Alert button
Mar 13, 2018
Zachary C. Lipton, Kamyar Azizzadenesheli, Abhishek Kumar, Lihong Li, Jianfeng Gao, Li Deng

Figure 1 for Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Figure 2 for Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Figure 3 for Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Viaarxiv icon

End-to-End Task-Completion Neural Dialogue Systems

Add code
Bookmark button
Alert button
Feb 11, 2018
Xiujun Li, Yun-Nung Chen, Lihong Li, Jianfeng Gao, Asli Celikyilmaz

Figure 1 for End-to-End Task-Completion Neural Dialogue Systems
Figure 2 for End-to-End Task-Completion Neural Dialogue Systems
Figure 3 for End-to-End Task-Completion Neural Dialogue Systems
Figure 4 for End-to-End Task-Completion Neural Dialogue Systems
Viaarxiv icon

Boosting the Actor with Dual Critic

Add code
Bookmark button
Alert button
Dec 29, 2017
Bo Dai, Albert Shaw, Niao He, Lihong Li, Le Song

Figure 1 for Boosting the Actor with Dual Critic
Figure 2 for Boosting the Actor with Dual Critic
Figure 3 for Boosting the Actor with Dual Critic
Viaarxiv icon

BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems

Add code
Bookmark button
Alert button
Nov 23, 2017
Zachary C. Lipton, Xiujun Li, Jianfeng Gao, Lihong Li, Faisal Ahmed, Li Deng

Figure 1 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Figure 2 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Figure 3 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Figure 4 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Viaarxiv icon

A User Simulator for Task-Completion Dialogues

Add code
Bookmark button
Alert button
Nov 13, 2017
Xiujun Li, Zachary C. Lipton, Bhuwan Dhingra, Lihong Li, Jianfeng Gao, Yun-Nung Chen

Figure 1 for A User Simulator for Task-Completion Dialogues
Figure 2 for A User Simulator for Task-Completion Dialogues
Figure 3 for A User Simulator for Task-Completion Dialogues
Figure 4 for A User Simulator for Task-Completion Dialogues
Viaarxiv icon

Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 22, 2017
Baolin Peng, Xiujun Li, Lihong Li, Jianfeng Gao, Asli Celikyilmaz, Sungjin Lee, Kam-Fai Wong

Figure 1 for Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Figure 2 for Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Figure 3 for Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Figure 4 for Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning
Viaarxiv icon

Provably Optimal Algorithms for Generalized Linear Contextual Bandits

Add code
Bookmark button
Alert button
Jun 18, 2017
Lihong Li, Yu Lu, Dengyong Zhou

Viaarxiv icon