Alert button
Picture for Zhengxiao Du

Zhengxiao Du

Alert button

ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline

Add code
Bookmark button
Alert button
Apr 03, 2024
Yifan Xu, Xiao Liu, Xinghan Liu, Zhenyu Hou, Yueyan Li, Xiaohan Zhang, Zihan Wang, Aohan Zeng, Zhengxiao Du, Wenyi Zhao, Jie Tang, Yuxiao Dong

Viaarxiv icon

ChatGLM-RLHF: Practices of Aligning Large Language Models with Human Feedback

Add code
Bookmark button
Alert button
Apr 03, 2024
Zhenyu Hou, Yilin Niu, Zhengxiao Du, Xiaohan Zhang, Xiao Liu, Aohan Zeng, Qinkai Zheng, Minlie Huang, Hongning Wang, Jie Tang, Yuxiao Dong

Viaarxiv icon

Understanding Emergent Abilities of Language Models from the Loss Perspective

Add code
Bookmark button
Alert button
Mar 30, 2024
Zhengxiao Du, Aohan Zeng, Yuxiao Dong, Jie Tang

Viaarxiv icon

SciGLM: Training Scientific Language Models with Self-Reflective Instruction Annotation and Tuning

Add code
Bookmark button
Alert button
Jan 15, 2024
Dan Zhang, Ziniu Hu, Sining Zhoubian, Zhengxiao Du, Kaiyu Yang, Zihan Wang, Yisong Yue, Yuxiao Dong, Jie Tang

Viaarxiv icon

LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding

Add code
Bookmark button
Alert button
Aug 28, 2023
Yushi Bai, Xin Lv, Jiajie Zhang, Hongchang Lyu, Jiankai Tang, Zhidian Huang, Zhengxiao Du, Xiao Liu, Aohan Zeng, Lei Hou, Yuxiao Dong, Jie Tang, Juanzi Li

Figure 1 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 2 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 3 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Figure 4 for LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding
Viaarxiv icon

AgentBench: Evaluating LLMs as Agents

Add code
Bookmark button
Alert button
Aug 07, 2023
Xiao Liu, Hao Yu, Hanchen Zhang, Yifan Xu, Xuanyu Lei, Hanyu Lai, Yu Gu, Hangliang Ding, Kaiwen Men, Kejuan Yang, Shudan Zhang, Xiang Deng, Aohan Zeng, Zhengxiao Du, Chenhui Zhang, Sheng Shen, Tianjun Zhang, Yu Su, Huan Sun, Minlie Huang, Yuxiao Dong, Jie Tang

Figure 1 for AgentBench: Evaluating LLMs as Agents
Figure 2 for AgentBench: Evaluating LLMs as Agents
Figure 3 for AgentBench: Evaluating LLMs as Agents
Figure 4 for AgentBench: Evaluating LLMs as Agents
Viaarxiv icon

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

Add code
Bookmark button
Alert button
Jun 13, 2023
Xiao Liu, Hanyu Lai, Hao Yu, Yifan Xu, Aohan Zeng, Zhengxiao Du, Peng Zhang, Yuxiao Dong, Jie Tang

Figure 1 for WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Figure 2 for WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Figure 3 for WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Figure 4 for WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
Viaarxiv icon

GLM-130B: An Open Bilingual Pre-trained Model

Add code
Bookmark button
Alert button
Oct 05, 2022
Aohan Zeng, Xiao Liu, Zhengxiao Du, Zihan Wang, Hanyu Lai, Ming Ding, Zhuoyi Yang, Yifan Xu, Wendi Zheng, Xiao Xia, Weng Lam Tam, Zixuan Ma, Yufei Xue, Jidong Zhai, Wenguang Chen, Peng Zhang, Yuxiao Dong, Jie Tang

Figure 1 for GLM-130B: An Open Bilingual Pre-trained Model
Figure 2 for GLM-130B: An Open Bilingual Pre-trained Model
Figure 3 for GLM-130B: An Open Bilingual Pre-trained Model
Figure 4 for GLM-130B: An Open Bilingual Pre-trained Model
Viaarxiv icon

P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks

Add code
Bookmark button
Alert button
Oct 18, 2021
Xiao Liu, Kaixuan Ji, Yicheng Fu, Zhengxiao Du, Zhilin Yang, Jie Tang

Figure 1 for P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Figure 2 for P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Figure 3 for P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Figure 4 for P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Viaarxiv icon

GPT Understands, Too

Add code
Bookmark button
Alert button
Mar 18, 2021
Xiao Liu, Yanan Zheng, Zhengxiao Du, Ming Ding, Yujie Qian, Zhilin Yang, Jie Tang

Figure 1 for GPT Understands, Too
Figure 2 for GPT Understands, Too
Figure 3 for GPT Understands, Too
Figure 4 for GPT Understands, Too
Viaarxiv icon