Picture for M Saiful Bari

M Saiful Bari

When Benchmarks are Targets: Revealing the Sensitivity of Large Language Model Leaderboards

Add code
Feb 01, 2024
Viaarxiv icon

BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP

Add code
Sep 22, 2023
Figure 1 for BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
Figure 2 for BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
Figure 3 for BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
Figure 4 for BenLLMEval: A Comprehensive Evaluation into the Potentials and Pitfalls of Large Language Models on Bengali NLP
Viaarxiv icon

A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets

Add code
Jun 08, 2023
Figure 1 for A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Figure 2 for A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Figure 3 for A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Figure 4 for A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets
Viaarxiv icon

xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval

Add code
Mar 06, 2023
Figure 1 for xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Figure 2 for xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Figure 3 for xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Figure 4 for xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval
Viaarxiv icon

SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning

Add code
Dec 21, 2022
Figure 1 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Figure 2 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Figure 3 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Figure 4 for SPT: Semi-Parametric Prompt Tuning for Multitask Prompted Learning
Viaarxiv icon

BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting

Add code
Dec 19, 2022
Figure 1 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 2 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 3 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Figure 4 for BLOOM+1: Adding Language Support to BLOOM for Zero-Shot Prompting
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

What Language Model to Train if You Have One Million GPU Hours?

Add code
Nov 08, 2022
Figure 1 for What Language Model to Train if You Have One Million GPU Hours?
Figure 2 for What Language Model to Train if You Have One Million GPU Hours?
Figure 3 for What Language Model to Train if You Have One Million GPU Hours?
Figure 4 for What Language Model to Train if You Have One Million GPU Hours?
Viaarxiv icon

Crosslingual Generalization through Multitask Finetuning

Add code
Nov 03, 2022
Viaarxiv icon

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

Add code
Feb 02, 2022
Figure 1 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 2 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 3 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 4 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Viaarxiv icon