Picture for Sharan Narang

Sharan Narang

Jack

UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining

Add code
Apr 18, 2023
Figure 1 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 2 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 3 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Figure 4 for UniMax: Fairer and more Effective Language Sampling for Large-Scale Multilingual Pretraining
Viaarxiv icon

Character-Aware Models Improve Visual Text Rendering

Add code
Dec 20, 2022
Figure 1 for Character-Aware Models Improve Visual Text Rendering
Figure 2 for Character-Aware Models Improve Visual Text Rendering
Figure 3 for Character-Aware Models Improve Visual Text Rendering
Figure 4 for Character-Aware Models Improve Visual Text Rendering
Viaarxiv icon

FCM: Forgetful Causal Masking Makes Causal Language Models Better Zero-Shot Learners

Add code
Oct 24, 2022
Viaarxiv icon

Scaling Instruction-Finetuned Language Models

Add code
Oct 20, 2022
Figure 1 for Scaling Instruction-Finetuned Language Models
Figure 2 for Scaling Instruction-Finetuned Language Models
Figure 3 for Scaling Instruction-Finetuned Language Models
Figure 4 for Scaling Instruction-Finetuned Language Models
Viaarxiv icon

Understanding HTML with Large Language Models

Add code
Oct 08, 2022
Figure 1 for Understanding HTML with Large Language Models
Figure 2 for Understanding HTML with Large Language Models
Figure 3 for Understanding HTML with Large Language Models
Figure 4 for Understanding HTML with Large Language Models
Viaarxiv icon

Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?

Add code
Jul 21, 2022
Figure 1 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 2 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 3 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Figure 4 for Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Viaarxiv icon

PaLM: Scaling Language Modeling with Pathways

Add code
Apr 19, 2022
Figure 1 for PaLM: Scaling Language Modeling with Pathways
Figure 2 for PaLM: Scaling Language Modeling with Pathways
Figure 3 for PaLM: Scaling Language Modeling with Pathways
Figure 4 for PaLM: Scaling Language Modeling with Pathways
Viaarxiv icon

Self-Consistency Improves Chain of Thought Reasoning in Language Models

Add code
Apr 06, 2022
Figure 1 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 2 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 3 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Figure 4 for Self-Consistency Improves Chain of Thought Reasoning in Language Models
Viaarxiv icon

Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$

Add code
Mar 31, 2022
Figure 1 for Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Figure 2 for Scaling Up Models and Data with $\texttt{t5x}$ and $\texttt{seqio}$
Viaarxiv icon

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers

Add code
Sep 22, 2021
Figure 1 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 2 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 3 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Figure 4 for Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
Viaarxiv icon