Get our free extension to see links to code for papers anywhere online!

Chrome logo  Add to Chrome

Firefox logo Add to Firefox


Towards Standard Criteria for human evaluation of Chatbots: A Survey

May 24, 2021
Hongru Liang, Huaqing Li


Share this with someone who'll enjoy it:


Human evaluation is becoming a necessity to test the performance of Chatbots. However, off-the-shelf settings suffer the severe reliability and replication issues partly because of the extremely high diversity of criteria. It is high time to come up with standard criteria and exact definitions. To this end, we conduct a through investigation of 105 papers involving human evaluation for Chatbots. Deriving from this, we propose five standard criteria along with precise definitions.



   Access Paper Source



Share this with someone who'll enjoy it: