Alert button
Picture for Zhuoer Feng

Zhuoer Feng

Alert button

AlignBench: Benchmarking Chinese Alignment of Large Language Models

Add code
Bookmark button
Alert button
Dec 05, 2023
Xiao Liu, Xuanyu Lei, Shengyuan Wang, Yue Huang, Zhuoer Feng, Bosi Wen, Jiale Cheng, Pei Ke, Yifan Xu, Weng Lam Tam, Xiaohan Zhang, Lichao Sun, Hongning Wang, Jing Zhang, Minlie Huang, Yuxiao Dong, Jie Tang

Viaarxiv icon

CritiqueLLM: Scaling LLM-as-Critic for Effective and Explainable Evaluation of Large Language Model Generation

Add code
Bookmark button
Alert button
Nov 30, 2023
Pei Ke, Bosi Wen, Zhuoer Feng, Xiao Liu, Xuanyu Lei, Jiale Cheng, Shengyuan Wang, Aohan Zeng, Yuxiao Dong, Hongning Wang, Jie Tang, Minlie Huang

Viaarxiv icon

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

Add code
Bookmark button
Alert button
Aug 30, 2021
Jian Guan, Zhuoer Feng, Yamei Chen, Ruilin He, Xiaoxi Mao, Changjie Fan, Minlie Huang

Figure 1 for LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation
Figure 2 for LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation
Figure 3 for LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation
Figure 4 for LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation
Viaarxiv icon

OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics

Add code
Bookmark button
Alert button
May 19, 2021
Jian Guan, Zhexin Zhang, Zhuoer Feng, Zitao Liu, Wenbiao Ding, Xiaoxi Mao, Changjie Fan, Minlie Huang

Figure 1 for OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Figure 2 for OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Figure 3 for OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Figure 4 for OpenMEVA: A Benchmark for Evaluating Open-ended Story Generation Metrics
Viaarxiv icon