Picture for Jakob Nicolaus Foerster

Jakob Nicolaus Foerster

Multi-Agent Craftax: Benchmarking Open-Ended Multi-Agent Reinforcement Learning at the Hyperscale

Add code
Nov 07, 2025
Viaarxiv icon

Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents

Add code
Sep 03, 2025
Viaarxiv icon

How Should We Meta-Learn Reinforcement Learning Algorithms?

Add code
Jul 23, 2025
Viaarxiv icon

AI Research Agents for Machine Learning: Search, Exploration, and Generalization in MLE-bench

Add code
Jul 03, 2025
Viaarxiv icon

Ad-Hoc Human-AI Coordination Challenge

Add code
Jun 26, 2025
Figure 1 for Ad-Hoc Human-AI Coordination Challenge
Figure 2 for Ad-Hoc Human-AI Coordination Challenge
Figure 3 for Ad-Hoc Human-AI Coordination Challenge
Figure 4 for Ad-Hoc Human-AI Coordination Challenge
Viaarxiv icon

A Clean Slate for Offline Reinforcement Learning

Add code
Apr 15, 2025
Figure 1 for A Clean Slate for Offline Reinforcement Learning
Figure 2 for A Clean Slate for Offline Reinforcement Learning
Figure 3 for A Clean Slate for Offline Reinforcement Learning
Figure 4 for A Clean Slate for Offline Reinforcement Learning
Viaarxiv icon

AgentBreeder: Mitigating the AI Safety Impact of Multi-Agent Scaffolds

Add code
Feb 02, 2025
Viaarxiv icon

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Add code
Nov 20, 2024
Figure 1 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Figure 2 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Figure 3 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Figure 4 for BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Viaarxiv icon

Learning Loss Landscapes in Preference Optimization

Add code
Nov 10, 2024
Figure 1 for Learning Loss Landscapes in Preference Optimization
Figure 2 for Learning Loss Landscapes in Preference Optimization
Figure 3 for Learning Loss Landscapes in Preference Optimization
Figure 4 for Learning Loss Landscapes in Preference Optimization
Viaarxiv icon

Can Learned Optimization Make Reinforcement Learning Less Difficult?

Add code
Jul 09, 2024
Viaarxiv icon