Alert button
Picture for Pengfei Liu

Pengfei Liu

Alert button

Can Large Language Models be Trusted for Evaluation? Scalable Meta-Evaluation of LLMs as Evaluators via Agent Debate

Add code
Bookmark button
Alert button
Jan 30, 2024
Steffi Chern, Ethan Chern, Graham Neubig, Pengfei Liu

Viaarxiv icon

Extending LLMs' Context Window with 100 Samples

Add code
Bookmark button
Alert button
Jan 13, 2024
Yikai Zhang, Junlong Li, Pengfei Liu

Viaarxiv icon

The Critique of Critique

Add code
Bookmark button
Alert button
Jan 09, 2024
Shichao Sun, Junlong Li, Weizhe Yuan, Ruifeng Yuan, Wenjie Li, Pengfei Liu

Viaarxiv icon

InFoBench: Evaluating Instruction Following Ability in Large Language Models

Add code
Bookmark button
Alert button
Jan 07, 2024
Yiwei Qin, Kaiqiang Song, Yebowen Hu, Wenlin Yao, Sangwoo Cho, Xiaoyang Wang, Xuansheng Wu, Fei Liu, Pengfei Liu, Dong Yu

Viaarxiv icon

Generative AI for Math: Part I -- MathPile: A Billion-Token-Scale Pretraining Corpus for Math

Add code
Bookmark button
Alert button
Dec 28, 2023
Zengzhi Wang, Rui Xia, Pengfei Liu

Viaarxiv icon

How Far Are We from Believable AI Agents? A Framework for Evaluating the Believability of Human Behavior Simulation

Add code
Bookmark button
Alert button
Dec 28, 2023
Yang Xiao, Yi Cheng, Jinlan Fu, Jiashuo Wang, Wenjie Li, Pengfei Liu

Viaarxiv icon

Align on the Fly: Adapting Chatbot Behavior to Established Norms

Add code
Bookmark button
Alert button
Dec 26, 2023
Chunpu Xu, Steffi Chern, Ethan Chern, Ge Zhang, Zekun Wang, Ruibo Liu, Jing Li, Jie Fu, Pengfei Liu

Viaarxiv icon

Alignment for Honesty

Add code
Bookmark button
Alert button
Dec 12, 2023
Yuqing Yang, Ethan Chern, Xipeng Qiu, Graham Neubig, Pengfei Liu

Viaarxiv icon

Benchmarking Generation and Evaluation Capabilities of Large Language Models for Instruction Controllable Summarization

Add code
Bookmark button
Alert button
Nov 15, 2023
Yixin Liu, Alexander R. Fabbri, Jiawen Chen, Yilun Zhao, Simeng Han, Shafiq Joty, Pengfei Liu, Dragomir Radev, Chien-Sheng Wu, Arman Cohan

Viaarxiv icon

LoBaSS: Gauging Learnability in Supervised Fine-tuning Data

Add code
Bookmark button
Alert button
Oct 16, 2023
Haotian Zhou, Tingkai Liu, Qianli Ma, Jianbo Yuan, Pengfei Liu, Yang You, Hongxia Yang

Figure 1 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Figure 2 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Figure 3 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Figure 4 for LoBaSS: Gauging Learnability in Supervised Fine-tuning Data
Viaarxiv icon