Picture for Lihong Li

Lihong Li

Subgoal Discovery for Hierarchical Dialogue Policy Learning

Add code
Sep 22, 2018
Figure 1 for Subgoal Discovery for Hierarchical Dialogue Policy Learning
Figure 2 for Subgoal Discovery for Hierarchical Dialogue Policy Learning
Figure 3 for Subgoal Discovery for Hierarchical Dialogue Policy Learning
Figure 4 for Subgoal Discovery for Hierarchical Dialogue Policy Learning
Viaarxiv icon

Neural Approaches to Conversational AI

Add code
Sep 21, 2018
Figure 1 for Neural Approaches to Conversational AI
Viaarxiv icon

Data Poisoning Attacks in Contextual Bandits

Add code
Aug 24, 2018
Figure 1 for Data Poisoning Attacks in Contextual Bandits
Figure 2 for Data Poisoning Attacks in Contextual Bandits
Figure 3 for Data Poisoning Attacks in Contextual Bandits
Figure 4 for Data Poisoning Attacks in Contextual Bandits
Viaarxiv icon

SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation

Add code
Jun 05, 2018
Figure 1 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Figure 2 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Figure 3 for SBEED: Convergent Reinforcement Learning with Nonlinear Function Approximation
Viaarxiv icon

Scalable Bilinear $π$ Learning Using State and Action Features

Add code
Apr 27, 2018
Viaarxiv icon

Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear

Add code
Mar 13, 2018
Figure 1 for Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Figure 2 for Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Figure 3 for Combating Reinforcement Learning's Sisyphean Curse with Intrinsic Fear
Viaarxiv icon

End-to-End Task-Completion Neural Dialogue Systems

Add code
Feb 11, 2018
Figure 1 for End-to-End Task-Completion Neural Dialogue Systems
Figure 2 for End-to-End Task-Completion Neural Dialogue Systems
Figure 3 for End-to-End Task-Completion Neural Dialogue Systems
Figure 4 for End-to-End Task-Completion Neural Dialogue Systems
Viaarxiv icon

Boosting the Actor with Dual Critic

Add code
Dec 29, 2017
Figure 1 for Boosting the Actor with Dual Critic
Figure 2 for Boosting the Actor with Dual Critic
Figure 3 for Boosting the Actor with Dual Critic
Viaarxiv icon

BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems

Add code
Nov 23, 2017
Figure 1 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Figure 2 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Figure 3 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Figure 4 for BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Viaarxiv icon

A User Simulator for Task-Completion Dialogues

Add code
Nov 13, 2017
Figure 1 for A User Simulator for Task-Completion Dialogues
Figure 2 for A User Simulator for Task-Completion Dialogues
Figure 3 for A User Simulator for Task-Completion Dialogues
Figure 4 for A User Simulator for Task-Completion Dialogues
Viaarxiv icon