Picture for David Dohan

David Dohan

Shammie

Training Chain-of-Thought via Latent-Variable Inference

Add code
Nov 28, 2023
Figure 1 for Training Chain-of-Thought via Latent-Variable Inference
Figure 2 for Training Chain-of-Thought via Latent-Variable Inference
Figure 3 for Training Chain-of-Thought via Latent-Variable Inference
Figure 4 for Training Chain-of-Thought via Latent-Variable Inference
Viaarxiv icon

Large Language Models Can Be Easily Distracted by Irrelevant Context

Add code
Feb 13, 2023
Figure 1 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Figure 2 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Figure 3 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Figure 4 for Large Language Models Can Be Easily Distracted by Irrelevant Context
Viaarxiv icon

Language Model Cascades

Add code
Jul 28, 2022
Figure 1 for Language Model Cascades
Figure 2 for Language Model Cascades
Figure 3 for Language Model Cascades
Figure 4 for Language Model Cascades
Viaarxiv icon

Solving Quantitative Reasoning Problems with Language Models

Add code
Jul 01, 2022
Figure 1 for Solving Quantitative Reasoning Problems with Language Models
Figure 2 for Solving Quantitative Reasoning Problems with Language Models
Figure 3 for Solving Quantitative Reasoning Problems with Language Models
Figure 4 for Solving Quantitative Reasoning Problems with Language Models
Viaarxiv icon

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Towards Learning Universal Hyperparameter Optimizers with Transformers

Add code
May 26, 2022
Figure 1 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 2 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 3 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Figure 4 for Towards Learning Universal Hyperparameter Optimizers with Transformers
Viaarxiv icon

PaLM: Scaling Language Modeling with Pathways

Add code
Apr 19, 2022
Figure 1 for PaLM: Scaling Language Modeling with Pathways
Figure 2 for PaLM: Scaling Language Modeling with Pathways
Figure 3 for PaLM: Scaling Language Modeling with Pathways
Figure 4 for PaLM: Scaling Language Modeling with Pathways
Viaarxiv icon

Show Your Work: Scratchpads for Intermediate Computation with Language Models

Add code
Nov 30, 2021
Figure 1 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 2 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 3 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Figure 4 for Show Your Work: Scratchpads for Intermediate Computation with Language Models
Viaarxiv icon

Program Synthesis with Large Language Models

Add code
Aug 16, 2021
Figure 1 for Program Synthesis with Large Language Models
Figure 2 for Program Synthesis with Large Language Models
Figure 3 for Program Synthesis with Large Language Models
Figure 4 for Program Synthesis with Large Language Models
Viaarxiv icon

Latent Programmer: Discrete Latent Codes for Program Synthesis

Add code
Dec 01, 2020
Figure 1 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 2 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 3 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Figure 4 for Latent Programmer: Discrete Latent Codes for Program Synthesis
Viaarxiv icon