Picture for Colin Raffel

Colin Raffel

Shammie

ColD Fusion: Collaborative Descent for Distributed Multitask Finetuning

Add code
Dec 02, 2022
Viaarxiv icon

Evaluating the Factual Consistency of Large Language Models Through Summarization

Add code
Nov 15, 2022
Viaarxiv icon

Large Language Models Struggle to Learn Long-Tail Knowledge

Add code
Nov 15, 2022
Figure 1 for Large Language Models Struggle to Learn Long-Tail Knowledge
Figure 2 for Large Language Models Struggle to Learn Long-Tail Knowledge
Figure 3 for Large Language Models Struggle to Learn Long-Tail Knowledge
Figure 4 for Large Language Models Struggle to Learn Long-Tail Knowledge
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

What Language Model to Train if You Have One Million GPU Hours?

Add code
Nov 08, 2022
Figure 1 for What Language Model to Train if You Have One Million GPU Hours?
Figure 2 for What Language Model to Train if You Have One Million GPU Hours?
Figure 3 for What Language Model to Train if You Have One Million GPU Hours?
Figure 4 for What Language Model to Train if You Have One Million GPU Hours?
Viaarxiv icon

Crosslingual Generalization through Multitask Finetuning

Add code
Nov 03, 2022
Viaarxiv icon

Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language

Add code
Oct 05, 2022
Figure 1 for Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language
Figure 2 for Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language
Figure 3 for Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language
Figure 4 for Compositional Generalization in Unsupervised Compositional Representation Learning: A Study on Disentanglement and Emergent Language
Viaarxiv icon

A Combinatorial Perspective on the Optimization of Shallow ReLU Networks

Add code
Oct 01, 2022
Figure 1 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 2 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 3 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Figure 4 for A Combinatorial Perspective on the Optimization of Shallow ReLU Networks
Viaarxiv icon

Bidirectional Language Models Are Also Few-shot Learners

Add code
Sep 29, 2022
Figure 1 for Bidirectional Language Models Are Also Few-shot Learners
Figure 2 for Bidirectional Language Models Are Also Few-shot Learners
Figure 3 for Bidirectional Language Models Are Also Few-shot Learners
Figure 4 for Bidirectional Language Models Are Also Few-shot Learners
Viaarxiv icon

Petals: Collaborative Inference and Fine-tuning of Large Models

Add code
Sep 02, 2022
Figure 1 for Petals: Collaborative Inference and Fine-tuning of Large Models
Figure 2 for Petals: Collaborative Inference and Fine-tuning of Large Models
Figure 3 for Petals: Collaborative Inference and Fine-tuning of Large Models
Figure 4 for Petals: Collaborative Inference and Fine-tuning of Large Models
Viaarxiv icon