Picture for Yiduo Guo

Yiduo Guo

PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion

Add code
Mar 06, 2024
Figure 1 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Figure 2 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Figure 3 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Figure 4 for PPTC-R benchmark: Towards Evaluating the Robustness of Large Language Models for PowerPoint Task Completion
Viaarxiv icon

PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion

Add code
Nov 07, 2023
Figure 1 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Figure 2 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Figure 3 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Figure 4 for PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion
Viaarxiv icon

EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation

Oct 12, 2023
Figure 1 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 2 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 3 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Figure 4 for EIPE-text: Evaluation-Guided Iterative Plan Extraction for Long-Form Narrative Text Generation
Viaarxiv icon

Class Incremental Learning via Likelihood Ratio Based Task Prediction

Add code
Oct 01, 2023
Figure 1 for Class Incremental Learning via Likelihood Ratio Based Task Prediction
Figure 2 for Class Incremental Learning via Likelihood Ratio Based Task Prediction
Figure 3 for Class Incremental Learning via Likelihood Ratio Based Task Prediction
Figure 4 for Class Incremental Learning via Likelihood Ratio Based Task Prediction
Viaarxiv icon

Class-Incremental Learning based on Label Generation

Add code
Jun 22, 2023
Figure 1 for Class-Incremental Learning based on Label Generation
Figure 2 for Class-Incremental Learning based on Label Generation
Figure 3 for Class-Incremental Learning based on Label Generation
Figure 4 for Class-Incremental Learning based on Label Generation
Viaarxiv icon

Dealing with Cross-Task Class Discrimination in Online Continual Learning

Add code
May 24, 2023
Figure 1 for Dealing with Cross-Task Class Discrimination in Online Continual Learning
Figure 2 for Dealing with Cross-Task Class Discrimination in Online Continual Learning
Figure 3 for Dealing with Cross-Task Class Discrimination in Online Continual Learning
Figure 4 for Dealing with Cross-Task Class Discrimination in Online Continual Learning
Viaarxiv icon

Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast

May 19, 2023
Figure 1 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Figure 2 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Figure 3 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Figure 4 for Analyzing and Reducing the Performance Gap in Cross-Lingual Transfer with Fine-tuning Slow and Fast
Viaarxiv icon

Learning to Program with Natural Language

Add code
Apr 23, 2023
Figure 1 for Learning to Program with Natural Language
Figure 2 for Learning to Program with Natural Language
Figure 3 for Learning to Program with Natural Language
Figure 4 for Learning to Program with Natural Language
Viaarxiv icon

AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models

Add code
Apr 13, 2023
Figure 1 for AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
Figure 2 for AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
Figure 3 for AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
Figure 4 for AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models
Viaarxiv icon