Alert button

Imperfect also Deserves Reward: Multi-Level and Sequential Reward Modeling for Better Dialog Management

Add code
Bookmark button
Alert button
Apr 10, 2021
Zhengxu Hou, Bang Liu, Ruihui Zhao, Zijing Ou, Yafei Liu, Xi Chen, Yefeng Zheng

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: