Picture for Denny Zhou

Denny Zhou

Flan-MoE: Scaling Instruction-Finetuned Language Models with Sparse Mixture of Experts

Add code
May 24, 2023
Viaarxiv icon

A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity

Add code
May 22, 2023
Figure 1 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 2 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 3 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Figure 4 for A Pretrainer's Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
Viaarxiv icon

Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization

Add code
May 19, 2023
Figure 1 for Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization
Figure 2 for Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization
Figure 3 for Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization
Figure 4 for Not All Semantics are Created Equal: Contrastive Self-supervised Learning with Automatic Temperature Individualization
Viaarxiv icon

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon

Symbol tuning improves in-context learning in language models

Add code
May 15, 2023
Figure 1 for Symbol tuning improves in-context learning in language models
Figure 2 for Symbol tuning improves in-context learning in language models
Figure 3 for Symbol tuning improves in-context learning in language models
Figure 4 for Symbol tuning improves in-context learning in language models
Viaarxiv icon

Teaching Large Language Models to Self-Debug

Add code
Apr 11, 2023
Figure 1 for Teaching Large Language Models to Self-Debug
Figure 2 for Teaching Large Language Models to Self-Debug
Figure 3 for Teaching Large Language Models to Self-Debug
Figure 4 for Teaching Large Language Models to Self-Debug
Viaarxiv icon

Larger language models do in-context learning differently

Add code
Mar 08, 2023
Figure 1 for Larger language models do in-context learning differently
Figure 2 for Larger language models do in-context learning differently
Figure 3 for Larger language models do in-context learning differently
Figure 4 for Larger language models do in-context learning differently
Viaarxiv icon

The Flan Collection: Designing Data and Methods for Effective Instruction Tuning

Add code
Feb 14, 2023
Figure 1 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 2 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 3 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Figure 4 for The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Viaarxiv icon

Large Language Models Can Be Easily Distracted by Irrelevant Context

Add code
Feb 13, 2023
Figure 1 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Figure 2 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Figure 3 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Figure 4 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Viaarxiv icon

What learning algorithm is in-context learning? Investigations with linear models

Add code
Nov 29, 2022
Viaarxiv icon