Picture for Jakob N. Foerster

Jakob N. Foerster

University of Oxford

Intent Factored Generation: Unleashing the Diversity in Your Language Model

Add code
Jun 11, 2025
Viaarxiv icon

High Accuracy, Less Talk (HALT): Reliable LLMs through Capability-Aligned Finetuning

Add code
Jun 04, 2025
Viaarxiv icon

SOReL and TOReL: Two Methods for Fully Offline Reinforcement Learning

Add code
May 28, 2025
Viaarxiv icon

An Optimisation Framework for Unsupervised Environment Design

Add code
May 27, 2025
Viaarxiv icon

Beyond the Boundaries of Proximal Policy Optimization

Add code
Nov 01, 2024
Figure 1 for Beyond the Boundaries of Proximal Policy Optimization
Figure 2 for Beyond the Boundaries of Proximal Policy Optimization
Figure 3 for Beyond the Boundaries of Proximal Policy Optimization
Figure 4 for Beyond the Boundaries of Proximal Policy Optimization
Viaarxiv icon

Opponent Shaping for Antibody Development

Add code
Sep 19, 2024
Viaarxiv icon

Discovering Minimal Reinforcement Learning Environments

Add code
Jun 18, 2024
Viaarxiv icon

HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits

Add code
Jun 05, 2024
Figure 1 for HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
Figure 2 for HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
Figure 3 for HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
Figure 4 for HelloFresh: LLM Evaluations on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
Viaarxiv icon

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Add code
Dec 14, 2022
Figure 1 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

Game-Theoretical Perspectives on Active Equilibria: A Preferred Solution Concept over Nash Equilibria

Add code
Oct 28, 2022
Viaarxiv icon