Picture for Francis Song

Francis Song

Solving math word problems with process- and outcome-based feedback

Add code
Nov 25, 2022
Figure 1 for Solving math word problems with process- and outcome-based feedback
Figure 2 for Solving math word problems with process- and outcome-based feedback
Figure 3 for Solving math word problems with process- and outcome-based feedback
Figure 4 for Solving math word problems with process- and outcome-based feedback
Viaarxiv icon

Teaching language models to support answers with verified quotes

Add code
Mar 21, 2022
Figure 1 for Teaching language models to support answers with verified quotes
Figure 2 for Teaching language models to support answers with verified quotes
Figure 3 for Teaching language models to support answers with verified quotes
Figure 4 for Teaching language models to support answers with verified quotes
Viaarxiv icon

Red Teaming Language Models with Language Models

Add code
Feb 07, 2022
Figure 1 for Red Teaming Language Models with Language Models
Figure 2 for Red Teaming Language Models with Language Models
Figure 3 for Red Teaming Language Models with Language Models
Figure 4 for Red Teaming Language Models with Language Models
Viaarxiv icon

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

Add code
Dec 08, 2021
Figure 1 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 2 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 3 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Figure 4 for Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Viaarxiv icon

Synthetic Returns for Long-Term Credit Assignment

Add code
Feb 24, 2021
Figure 1 for Synthetic Returns for Long-Term Credit Assignment
Figure 2 for Synthetic Returns for Long-Term Credit Assignment
Figure 3 for Synthetic Returns for Long-Term Credit Assignment
Figure 4 for Synthetic Returns for Long-Term Credit Assignment
Viaarxiv icon

Alchemy: A structured task distribution for meta-reinforcement learning

Add code
Feb 04, 2021
Figure 1 for Alchemy: A structured task distribution for meta-reinforcement learning
Figure 2 for Alchemy: A structured task distribution for meta-reinforcement learning
Figure 3 for Alchemy: A structured task distribution for meta-reinforcement learning
Figure 4 for Alchemy: A structured task distribution for meta-reinforcement learning
Viaarxiv icon

Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning

Add code
Nov 04, 2018
Figure 1 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 2 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 3 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Figure 4 for Bayesian Action Decoder for Deep Multi-Agent Reinforcement Learning
Viaarxiv icon

Relational inductive biases, deep learning, and graph networks

Add code
Oct 17, 2018
Figure 1 for Relational inductive biases, deep learning, and graph networks
Figure 2 for Relational inductive biases, deep learning, and graph networks
Figure 3 for Relational inductive biases, deep learning, and graph networks
Figure 4 for Relational inductive biases, deep learning, and graph networks
Viaarxiv icon