Picture for Alex Ray

Alex Ray

Shammie

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Add code
Jun 10, 2022
Viaarxiv icon

Training language models to follow instructions with human feedback

Add code
Mar 04, 2022
Figure 1 for Training language models to follow instructions with human feedback
Figure 2 for Training language models to follow instructions with human feedback
Figure 3 for Training language models to follow instructions with human feedback
Figure 4 for Training language models to follow instructions with human feedback
Viaarxiv icon

Unsupervised Neural Machine Translation with Generative Language Models Only

Add code
Oct 11, 2021
Figure 1 for Unsupervised Neural Machine Translation with Generative Language Models Only
Figure 2 for Unsupervised Neural Machine Translation with Generative Language Models Only
Figure 3 for Unsupervised Neural Machine Translation with Generative Language Models Only
Figure 4 for Unsupervised Neural Machine Translation with Generative Language Models Only
Viaarxiv icon

Evaluating Large Language Models Trained on Code

Add code
Jul 14, 2021
Figure 1 for Evaluating Large Language Models Trained on Code
Figure 2 for Evaluating Large Language Models Trained on Code
Figure 3 for Evaluating Large Language Models Trained on Code
Figure 4 for Evaluating Large Language Models Trained on Code
Viaarxiv icon

Learning Dexterous In-Hand Manipulation

Add code
Jan 18, 2019
Figure 1 for Learning Dexterous In-Hand Manipulation
Figure 2 for Learning Dexterous In-Hand Manipulation
Figure 3 for Learning Dexterous In-Hand Manipulation
Figure 4 for Learning Dexterous In-Hand Manipulation
Viaarxiv icon

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

Add code
Mar 10, 2018
Figure 1 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Figure 2 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Figure 3 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Figure 4 for Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research
Viaarxiv icon

Hindsight Experience Replay

Add code
Feb 23, 2018
Figure 1 for Hindsight Experience Replay
Figure 2 for Hindsight Experience Replay
Figure 3 for Hindsight Experience Replay
Figure 4 for Hindsight Experience Replay
Viaarxiv icon

Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World

Add code
Mar 20, 2017
Figure 1 for Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Figure 2 for Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Figure 3 for Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Figure 4 for Domain Randomization for Transferring Deep Neural Networks from Simulation to the Real World
Viaarxiv icon