Alert button

MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues

Add code
Bookmark button
Alert button
Feb 22, 2024
Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: