Picture for Tom Henighan

Tom Henighan

Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Add code
Apr 12, 2022
Figure 1 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Figure 2 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Figure 3 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Figure 4 for Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Viaarxiv icon

A General Language Assistant as a Laboratory for Alignment

Add code
Dec 09, 2021
Figure 1 for A General Language Assistant as a Laboratory for Alignment
Figure 2 for A General Language Assistant as a Laboratory for Alignment
Figure 3 for A General Language Assistant as a Laboratory for Alignment
Figure 4 for A General Language Assistant as a Laboratory for Alignment
Viaarxiv icon

Scaling Laws for Transfer

Add code
Feb 02, 2021
Figure 1 for Scaling Laws for Transfer
Figure 2 for Scaling Laws for Transfer
Figure 3 for Scaling Laws for Transfer
Figure 4 for Scaling Laws for Transfer
Viaarxiv icon

Scaling Laws for Autoregressive Generative Modeling

Add code
Nov 06, 2020
Figure 1 for Scaling Laws for Autoregressive Generative Modeling
Figure 2 for Scaling Laws for Autoregressive Generative Modeling
Figure 3 for Scaling Laws for Autoregressive Generative Modeling
Figure 4 for Scaling Laws for Autoregressive Generative Modeling
Viaarxiv icon

Language Models are Few-Shot Learners

Add code
Jun 05, 2020
Figure 1 for Language Models are Few-Shot Learners
Figure 2 for Language Models are Few-Shot Learners
Figure 3 for Language Models are Few-Shot Learners
Figure 4 for Language Models are Few-Shot Learners
Viaarxiv icon

Scaling Laws for Neural Language Models

Add code
Jan 23, 2020
Figure 1 for Scaling Laws for Neural Language Models
Figure 2 for Scaling Laws for Neural Language Models
Figure 3 for Scaling Laws for Neural Language Models
Figure 4 for Scaling Laws for Neural Language Models
Viaarxiv icon