Picture for Lilian Weng

Lilian Weng

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Add code
Apr 19, 2024
Viaarxiv icon

A Holistic Approach to Undesired Content Detection in the Real World

Add code
Aug 05, 2022
Figure 1 for A Holistic Approach to Undesired Content Detection in the Real World
Figure 2 for A Holistic Approach to Undesired Content Detection in the Real World
Figure 3 for A Holistic Approach to Undesired Content Detection in the Real World
Figure 4 for A Holistic Approach to Undesired Content Detection in the Real World
Viaarxiv icon

Exploration in Deep Reinforcement Learning: A Survey

Add code
May 02, 2022
Figure 1 for Exploration in Deep Reinforcement Learning: A Survey
Figure 2 for Exploration in Deep Reinforcement Learning: A Survey
Figure 3 for Exploration in Deep Reinforcement Learning: A Survey
Figure 4 for Exploration in Deep Reinforcement Learning: A Survey
Viaarxiv icon

Text and Code Embeddings by Contrastive Pre-Training

Add code
Jan 24, 2022
Figure 1 for Text and Code Embeddings by Contrastive Pre-Training
Figure 2 for Text and Code Embeddings by Contrastive Pre-Training
Figure 3 for Text and Code Embeddings by Contrastive Pre-Training
Figure 4 for Text and Code Embeddings by Contrastive Pre-Training
Viaarxiv icon

Asymmetric self-play for automatic goal discovery in robotic manipulation

Add code
Jan 13, 2021
Figure 1 for Asymmetric self-play for automatic goal discovery in robotic manipulation
Figure 2 for Asymmetric self-play for automatic goal discovery in robotic manipulation
Figure 3 for Asymmetric self-play for automatic goal discovery in robotic manipulation
Figure 4 for Asymmetric self-play for automatic goal discovery in robotic manipulation
Viaarxiv icon

Automatic Curriculum Learning For Deep RL: A Short Survey

Add code
Mar 10, 2020
Figure 1 for Automatic Curriculum Learning For Deep RL: A Short Survey
Figure 2 for Automatic Curriculum Learning For Deep RL: A Short Survey
Viaarxiv icon

Solving Rubik's Cube with a Robot Hand

Add code
Oct 16, 2019
Figure 1 for Solving Rubik's Cube with a Robot Hand
Figure 2 for Solving Rubik's Cube with a Robot Hand
Figure 3 for Solving Rubik's Cube with a Robot Hand
Figure 4 for Solving Rubik's Cube with a Robot Hand
Viaarxiv icon

ORRB -- OpenAI Remote Rendering Backend

Add code
Jun 26, 2019
Figure 1 for ORRB -- OpenAI Remote Rendering Backend
Figure 2 for ORRB -- OpenAI Remote Rendering Backend
Viaarxiv icon

From GAN to WGAN

Add code
Apr 18, 2019
Figure 1 for From GAN to WGAN
Figure 2 for From GAN to WGAN
Figure 3 for From GAN to WGAN
Figure 4 for From GAN to WGAN
Viaarxiv icon

Learning Dexterous In-Hand Manipulation

Add code
Jan 18, 2019
Figure 1 for Learning Dexterous In-Hand Manipulation
Figure 2 for Learning Dexterous In-Hand Manipulation
Figure 3 for Learning Dexterous In-Hand Manipulation
Figure 4 for Learning Dexterous In-Hand Manipulation
Viaarxiv icon