Alert button
Picture for Yaobo Liang

Yaobo Liang

Alert button

PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion

Add code
Bookmark button
Alert button
Mar 06, 2024
Zekai Zhang, Yiduo Guo, Yaobo Liang, Dongyan Zhao, Nan Duan

Figure 1 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Figure 2 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Figure 3 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Figure 4 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Viaarxiv icon

Competition-Level Problems are Effective LLM Evaluators

Add code
Bookmark button
Alert button
Dec 05, 2023
Yiming Huang, Zhenghao Lin, Xiao Liu, Yeyun Gong, Shuai Lu, Fangyu Lei, Yaobo Liang, Yelong Shen, Chen Lin, Nan Duan, Weizhu Chen

Figure 1 for Competition-Level Problems are Effective LLM Evaluators
Figure 2 for Competition-Level Problems are Effective LLM Evaluators
Figure 3 for Competition-Level Problems are Effective LLM Evaluators
Figure 4 for Competition-Level Problems are Effective LLM Evaluators
Viaarxiv icon

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Add code
Bookmark button
Alert button
Nov 07, 2023
Yiduo Guo, Zekai Zhang, Yaobo Liang, Dongyan Zhao, Nan Duan

Figure 1 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Figure 2 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Figure 3 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Figure 4 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Viaarxiv icon

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation

Add code
Bookmark button
Alert button
Oct 12, 2023
Wang You, Wenshan Wu, Yaobo Liang, Shaoguang Mao, Chenfei Wu, Maosong Cao, Yuzhe Cai, Yiduo Guo, Yan Xia, Furu Wei, Nan Duan

Figure 1 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 2 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 3 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 4 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Viaarxiv icon

GameEval: Evaluating LLMs on Conversational Games

Add code
Bookmark button
Alert button
Aug 19, 2023
Dan Qiao, Chenfei Wu, Yaobo Liang, Juntao Li, Nan Duan

Figure 1 for GameEval: Evaluating LLMs on Conversational Games
Figure 2 for GameEval: Evaluating LLMs on Conversational Games
Figure 3 for GameEval: Evaluating LLMs on Conversational Games
Figure 4 for GameEval: Evaluating LLMs on Conversational Games
Viaarxiv icon

Machine-Created Universal Language for Cross-lingual Transfer

Add code
Bookmark button
Alert button
May 22, 2023
Yaobo Liang, Quanzhi Zhu, Junhe Zhao, Nan Duan

Figure 1 for Machine-Created Universal Language for Cross-lingual Transfer
Figure 2 for Machine-Created Universal Language for Cross-lingual Transfer
Figure 3 for Machine-Created Universal Language for Cross-lingual Transfer
Figure 4 for Machine-Created Universal Language for Cross-lingual Transfer
Viaarxiv icon

Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast

Add code
Bookmark button
Alert button
May 19, 2023
Yiduo Guo, Yaobo Liang, Dongyan Zhao, Bing Liu, Duan Nan

Figure 1 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Figure 2 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Figure 3 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Figure 4 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Viaarxiv icon

Learning to Program with Natural Language

Add code
Bookmark button
Alert button
Apr 23, 2023
Yiduo Guo, Yaobo Liang, Chenfei Wu, Wenshan Wu, Dongyan Zhao, Nan Duan

Figure 1 for Learning to Program with Natural Language
Figure 2 for Learning to Program with Natural Language
Figure 3 for Learning to Program with Natural Language
Figure 4 for Learning to Program with Natural Language
Viaarxiv icon