Alert button
Picture for Deyi Xiong

Deyi Xiong

Alert button

LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models

Add code
Bookmark button
Alert button
Mar 19, 2024
Chuang Liu, Renren Jin, Yuqi Ren, Deyi Xiong

Figure 1 for LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Figure 2 for LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Figure 3 for LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Figure 4 for LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Viaarxiv icon

OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety

Add code
Bookmark button
Alert button
Mar 18, 2024
Chuang Liu, Linhao Yu, Jiaxuan Li, Renren Jin, Yufei Huang, Ling Shi, Junhui Zhang, Xinmeng Ji, Tingting Cui, Tao Liu, Jinwang Song, Hongying Zan, Sun Li, Deyi Xiong

Figure 1 for OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Figure 2 for OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Figure 3 for OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Figure 4 for OpenEval: Benchmarking Chinese LLMs across Capability, Alignment and Safety
Viaarxiv icon

FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models

Add code
Bookmark button
Alert button
Mar 12, 2024
Yan Liu, Renren Jin, Lin Shi, Zheng Yao, Deyi Xiong

Figure 1 for FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models
Figure 2 for FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models
Figure 3 for FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models
Figure 4 for FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models
Viaarxiv icon

Exploring Multilingual Human Value Concepts in Large Language Models: Is Value Alignment Consistent, Transferable and Controllable across Languages?

Add code
Bookmark button
Alert button
Feb 28, 2024
Shaoyang Xu, Weilong Dong, Zishan Guo, Xinwei Wu, Deyi Xiong

Viaarxiv icon

Do Large Language Models Mirror Cognitive Language Processing?

Add code
Bookmark button
Alert button
Feb 28, 2024
Yuqi Ren, Renren Jin, Tongxuan Zhang, Deyi Xiong

Viaarxiv icon

A Comprehensive Evaluation of Quantization Strategies for Large Language Models

Add code
Bookmark button
Alert button
Feb 26, 2024
Renren Jin, Jiangcun Du, Wuwei Huang, Wei Liu, Jian Luan, Bin Wang, Deyi Xiong

Viaarxiv icon

RoleEval: A Bilingual Role Evaluation Benchmark for Large Language Models

Add code
Bookmark button
Alert button
Dec 26, 2023
Tianhao Shen, Sun Li, Deyi Xiong

Viaarxiv icon

CORECODE: A Common Sense Annotated Dialogue Dataset with Benchmark Tasks for Chinese Large Language Models

Add code
Bookmark button
Alert button
Dec 20, 2023
Dan Shi, Chaobin You, Jiantao Huang, Taihao Li, Deyi Xiong

Viaarxiv icon

AI-driven emergence of frequency information non-uniform distribution via THz metasurface spectrum prediction

Add code
Bookmark button
Alert button
Dec 05, 2023
Xiaohua Xing, Yuqi Ren, Die Zou, Qiankun Zhang, Bingxuan Mao, Jianquan Yao, Deyi Xiong, Shuang Zhang, Liang Wu

Viaarxiv icon

FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models

Add code
Bookmark button
Alert button
Nov 16, 2023
Yimin Jing, Renren Jin, Jiahao Hu, Huishi Qiu, Xiaohua Wang, Peng Wang, Deyi Xiong

Viaarxiv icon