Picture for Filippos Christianos

Filippos Christianos

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Add code
Dec 22, 2023
Figure 1 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 2 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 3 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 4 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Viaarxiv icon

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

Add code
Oct 27, 2023
Viaarxiv icon

Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks

Add code
Sep 28, 2023
Viaarxiv icon

SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning

Add code
May 09, 2023
Figure 1 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Figure 2 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Figure 3 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Figure 4 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Viaarxiv icon

Revisiting the Gumbel-Softmax in MADDPG

Add code
Feb 23, 2023
Figure 1 for Revisiting the Gumbel-Softmax in MADDPG
Figure 2 for Revisiting the Gumbel-Softmax in MADDPG
Figure 3 for Revisiting the Gumbel-Softmax in MADDPG
Figure 4 for Revisiting the Gumbel-Softmax in MADDPG
Viaarxiv icon

Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

Add code
Oct 26, 2022
Figure 1 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 2 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 3 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 4 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Viaarxiv icon

Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

Add code
Sep 28, 2022
Figure 1 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Figure 2 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Figure 3 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Figure 4 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Viaarxiv icon

Deep Reinforcement Learning for Multi-Agent Interaction

Add code
Aug 02, 2022
Viaarxiv icon

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Add code
Jul 05, 2022
Figure 1 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Figure 2 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Figure 3 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Figure 4 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Viaarxiv icon

Decoupling Exploration and Exploitation in Reinforcement Learning

Add code
Jul 22, 2021
Figure 1 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 2 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 3 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 4 for Decoupling Exploration and Exploitation in Reinforcement Learning
Viaarxiv icon