Alert button
Picture for Joshua Achiam

Joshua Achiam

Alert button

A Hazard Analysis Framework for Code Synthesis Large Language Models

Add code
Bookmark button
Alert button
Jul 25, 2022
Heidy Khlaaf, Pamela Mishkin, Joshua Achiam, Gretchen Krueger, Miles Brundage

Figure 1 for A Hazard Analysis Framework for Code Synthesis Large Language Models
Figure 2 for A Hazard Analysis Framework for Code Synthesis Large Language Models
Figure 3 for A Hazard Analysis Framework for Code Synthesis Large Language Models
Viaarxiv icon

Responsive Safety in Reinforcement Learning by PID Lagrangian Methods

Add code
Bookmark button
Alert button
Jul 08, 2020
Adam Stooke, Joshua Achiam, Pieter Abbeel

Figure 1 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Figure 2 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Figure 3 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Figure 4 for Responsive Safety in Reinforcement Learning by PID Lagrangian Methods
Viaarxiv icon

Towards Characterizing Divergence in Deep Q-Learning

Add code
Bookmark button
Alert button
Mar 21, 2019
Joshua Achiam, Ethan Knight, Pieter Abbeel

Figure 1 for Towards Characterizing Divergence in Deep Q-Learning
Figure 2 for Towards Characterizing Divergence in Deep Q-Learning
Figure 3 for Towards Characterizing Divergence in Deep Q-Learning
Figure 4 for Towards Characterizing Divergence in Deep Q-Learning
Viaarxiv icon

On First-Order Meta-Learning Algorithms

Add code
Bookmark button
Alert button
Oct 22, 2018
Alex Nichol, Joshua Achiam, John Schulman

Figure 1 for On First-Order Meta-Learning Algorithms
Figure 2 for On First-Order Meta-Learning Algorithms
Figure 3 for On First-Order Meta-Learning Algorithms
Figure 4 for On First-Order Meta-Learning Algorithms
Viaarxiv icon

Variational Option Discovery Algorithms

Add code
Bookmark button
Alert button
Jul 26, 2018
Joshua Achiam, Harrison Edwards, Dario Amodei, Pieter Abbeel

Figure 1 for Variational Option Discovery Algorithms
Figure 2 for Variational Option Discovery Algorithms
Figure 3 for Variational Option Discovery Algorithms
Figure 4 for Variational Option Discovery Algorithms
Viaarxiv icon

Constrained Policy Optimization

Add code
Bookmark button
Alert button
May 30, 2017
Joshua Achiam, David Held, Aviv Tamar, Pieter Abbeel

Figure 1 for Constrained Policy Optimization
Viaarxiv icon

Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 06, 2017
Joshua Achiam, Shankar Sastry

Figure 1 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Figure 2 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Figure 3 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Figure 4 for Surprise-Based Intrinsic Motivation for Deep Reinforcement Learning
Viaarxiv icon

Easy Monotonic Policy Iteration

Add code
Bookmark button
Alert button
Feb 29, 2016
Joshua Achiam

Viaarxiv icon