Picture for Zaid Alyafeai

Zaid Alyafeai

Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models

Add code
Jun 28, 2023
Figure 1 for Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models
Figure 2 for Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models
Figure 3 for Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models
Figure 4 for Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models
Viaarxiv icon

The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset

Add code
Mar 07, 2023
Figure 1 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 2 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 3 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Figure 4 for The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Crosslingual Generalization through Multitask Finetuning

Add code
Nov 03, 2022
Viaarxiv icon

Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets

Add code
Aug 01, 2022
Figure 1 for Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets
Figure 2 for Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets
Figure 3 for Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets
Figure 4 for Masader Plus: A New Interface for Exploring +500 Arabic NLP Datasets
Viaarxiv icon

PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts

Add code
Feb 02, 2022
Figure 1 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 2 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 3 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Figure 4 for PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Viaarxiv icon

Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources

Add code
Jan 25, 2022
Figure 1 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Figure 2 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Figure 3 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Figure 4 for Documenting Geographically and Contextually Diverse Data Sources: The BigScience Catalogue of Language Data and Resources
Viaarxiv icon

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

Add code
Dec 20, 2021
Figure 1 for Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Viaarxiv icon

Multitask Prompted Training Enables Zero-Shot Task Generalization

Add code
Oct 15, 2021
Figure 1 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Figure 2 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Figure 3 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Figure 4 for Multitask Prompted Training Enables Zero-Shot Task Generalization
Viaarxiv icon

Masader: Metadata Sourcing for Arabic Text and Speech Data Resources

Add code
Oct 13, 2021
Figure 1 for Masader: Metadata Sourcing for Arabic Text and Speech Data Resources
Figure 2 for Masader: Metadata Sourcing for Arabic Text and Speech Data Resources
Figure 3 for Masader: Metadata Sourcing for Arabic Text and Speech Data Resources
Figure 4 for Masader: Metadata Sourcing for Arabic Text and Speech Data Resources
Viaarxiv icon