Picture for Edward Grefenstette

Edward Grefenstette

Scaling Opponent Shaping to High Dimensional Games

Add code
Dec 19, 2023
Figure 1 for Scaling Opponent Shaping to High Dimensional Games
Figure 2 for Scaling Opponent Shaping to High Dimensional Games
Figure 3 for Scaling Opponent Shaping to High Dimensional Games
Figure 4 for Scaling Opponent Shaping to High Dimensional Games
Viaarxiv icon

H-GAP: Humanoid Control with a Generalist Planner

Add code
Dec 05, 2023
Figure 1 for H-GAP: Humanoid Control with a Generalist Planner
Figure 2 for H-GAP: Humanoid Control with a Generalist Planner
Figure 3 for H-GAP: Humanoid Control with a Generalist Planner
Figure 4 for H-GAP: Humanoid Control with a Generalist Planner
Viaarxiv icon

minimax: Efficient Baselines for Autocurricula in JAX

Add code
Nov 23, 2023
Figure 1 for minimax: Efficient Baselines for Autocurricula in JAX
Figure 2 for minimax: Efficient Baselines for Autocurricula in JAX
Figure 3 for minimax: Efficient Baselines for Autocurricula in JAX
Figure 4 for minimax: Efficient Baselines for Autocurricula in JAX
Viaarxiv icon

Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks

Add code
Nov 21, 2023
Viaarxiv icon

Understanding the Effects of RLHF on LLM Generalisation and Diversity

Add code
Oct 10, 2023
Viaarxiv icon

Finetuning from Offline Reinforcement Learning: Challenges, Trade-offs and Practical Solutions

Add code
Mar 30, 2023
Viaarxiv icon

Optimal Transport for Offline Imitation Learning

Add code
Mar 24, 2023
Figure 1 for Optimal Transport for Offline Imitation Learning
Figure 2 for Optimal Transport for Offline Imitation Learning
Figure 3 for Optimal Transport for Offline Imitation Learning
Figure 4 for Optimal Transport for Offline Imitation Learning
Viaarxiv icon

General Intelligence Requires Rethinking Exploration

Add code
Nov 15, 2022
Viaarxiv icon

Large language models are not zero-shot communicators

Add code
Oct 26, 2022
Viaarxiv icon

Learning General World Models in a Handful of Reward-Free Deployments

Add code
Oct 23, 2022
Figure 1 for Learning General World Models in a Handful of Reward-Free Deployments
Figure 2 for Learning General World Models in a Handful of Reward-Free Deployments
Figure 3 for Learning General World Models in a Handful of Reward-Free Deployments
Figure 4 for Learning General World Models in a Handful of Reward-Free Deployments
Viaarxiv icon