Picture for Luke Zettlemoyer

Luke Zettlemoyer

University of Washington

Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models

Add code
Aug 05, 2022
Figure 1 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Figure 2 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Figure 3 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Figure 4 for Branch-Train-Merge: Embarrassingly Parallel Training of Expert Language Models
Viaarxiv icon

Questions Are All You Need to Train a Dense Passage Retriever

Add code
Jun 21, 2022
Figure 1 for Questions Are All You Need to Train a Dense Passage Retriever
Figure 2 for Questions Are All You Need to Train a Dense Passage Retriever
Figure 3 for Questions Are All You Need to Train a Dense Passage Retriever
Figure 4 for Questions Are All You Need to Train a Dense Passage Retriever
Viaarxiv icon

LegoNN: Building Modular Encoder-Decoder Models

Add code
Jun 07, 2022
Figure 1 for LegoNN: Building Modular Encoder-Decoder Models
Figure 2 for LegoNN: Building Modular Encoder-Decoder Models
Figure 3 for LegoNN: Building Modular Encoder-Decoder Models
Figure 4 for LegoNN: Building Modular Encoder-Decoder Models
Viaarxiv icon

Nearest Neighbor Zero-Shot Inference

Add code
May 27, 2022
Figure 1 for Nearest Neighbor Zero-Shot Inference
Figure 2 for Nearest Neighbor Zero-Shot Inference
Figure 3 for Nearest Neighbor Zero-Shot Inference
Figure 4 for Nearest Neighbor Zero-Shot Inference
Viaarxiv icon

Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI

Add code
May 25, 2022
Figure 1 for Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI
Figure 2 for Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI
Figure 3 for Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI
Figure 4 for Logical Satisfiability of Counterfactuals for Faithful Explanations in NLI
Viaarxiv icon

Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models

Add code
May 24, 2022
Figure 1 for Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models
Figure 2 for Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models
Figure 3 for Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models
Figure 4 for Analyzing the Mono- and Cross-Lingual Pretraining Dynamics of Multilingual Language Models
Viaarxiv icon

On the Role of Bidirectionality in Language Model Pre-Training

Add code
May 24, 2022
Figure 1 for On the Role of Bidirectionality in Language Model Pre-Training
Figure 2 for On the Role of Bidirectionality in Language Model Pre-Training
Figure 3 for On the Role of Bidirectionality in Language Model Pre-Training
Figure 4 for On the Role of Bidirectionality in Language Model Pre-Training
Viaarxiv icon

Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models

Add code
May 22, 2022
Figure 1 for Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
Figure 2 for Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
Figure 3 for Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
Figure 4 for Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models
Viaarxiv icon

Few-shot Mining of Naturally Occurring Inputs and Outputs

Add code
May 09, 2022
Figure 1 for Few-shot Mining of Naturally Occurring Inputs and Outputs
Figure 2 for Few-shot Mining of Naturally Occurring Inputs and Outputs
Figure 3 for Few-shot Mining of Naturally Occurring Inputs and Outputs
Figure 4 for Few-shot Mining of Naturally Occurring Inputs and Outputs
Viaarxiv icon

OPT: Open Pre-trained Transformer Language Models

Add code
May 05, 2022
Figure 1 for OPT: Open Pre-trained Transformer Language Models
Figure 2 for OPT: Open Pre-trained Transformer Language Models
Figure 3 for OPT: Open Pre-trained Transformer Language Models
Figure 4 for OPT: Open Pre-trained Transformer Language Models
Viaarxiv icon