Alert button
Picture for Filippos Christianos

Filippos Christianos

Alert button

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Add code
Bookmark button
Alert button
Dec 22, 2023
Filippos Christianos, Georgios Papoudakis, Matthieu Zimmer, Thomas Coste, Zhihao Wu, Jingxuan Chen, Khyati Khandelwal, James Doran, Xidong Feng, Jiacheng Liu, Zheng Xiong, Yicheng Luo, Jianye Hao, Kun Shao, Haitham Bou-Ammar, Jun Wang

Viaarxiv icon

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

Add code
Bookmark button
Alert button
Oct 27, 2023
Xue Yan, Yan Song, Xinyu Cui, Filippos Christianos, Haifeng Zhang, David Henry Mguni, Jun Wang

Viaarxiv icon

Intrinsic Language-Guided Exploration for Complex Long-Horizon Robotic Manipulation Tasks

Add code
Bookmark button
Alert button
Sep 28, 2023
Eleftherios Triantafyllidis, Filippos Christianos, Zhibin Li

Viaarxiv icon

SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
May 09, 2023
Adam Michalski, Filippos Christianos, Stefano V. Albrecht

Figure 1 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Figure 2 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Figure 3 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Figure 4 for SMAClite: A Lightweight Environment for Multi-Agent Reinforcement Learning
Viaarxiv icon

Revisiting the Gumbel-Softmax in MADDPG

Add code
Bookmark button
Alert button
Feb 23, 2023
Callum Rhys Tilbury, Filippos Christianos, Stefano V. Albrecht

Figure 1 for Revisiting the Gumbel-Softmax in MADDPG
Figure 2 for Revisiting the Gumbel-Softmax in MADDPG
Figure 3 for Revisiting the Gumbel-Softmax in MADDPG
Figure 4 for Revisiting the Gumbel-Softmax in MADDPG
Viaarxiv icon

Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

Add code
Bookmark button
Alert button
Oct 26, 2022
Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone

Figure 1 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 2 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 3 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 4 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Viaarxiv icon

Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Sep 28, 2022
Filippos Christianos, Georgios Papoudakis, Stefano V. Albrecht

Figure 1 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Figure 2 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Figure 3 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Figure 4 for Pareto Actor-Critic for Equilibrium Selection in Multi-Agent Reinforcement Learning
Viaarxiv icon

Deep Reinforcement Learning for Multi-Agent Interaction

Add code
Bookmark button
Alert button
Aug 02, 2022
Ibrahim H. Ahmed, Cillian Brewitt, Ignacio Carlucho, Filippos Christianos, Mhairi Dunion, Elliot Fosong, Samuel Garcin, Shangmin Guo, Balint Gyevnar, Trevor McInroe, Georgios Papoudakis, Arrasy Rahman, Lukas Schäfer, Massimiliano Tamborski, Giuseppe Vecchio, Cheng Wang, Stefano V. Albrecht

Viaarxiv icon

Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 05, 2022
Lukas Schäfer, Filippos Christianos, Amos Storkey, Stefano V. Albrecht

Figure 1 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Figure 2 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Figure 3 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Figure 4 for Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning
Viaarxiv icon

Decoupling Exploration and Exploitation in Reinforcement Learning

Add code
Bookmark button
Alert button
Jul 22, 2021
Lukas Schäfer, Filippos Christianos, Josiah Hanna, Stefano V. Albrecht

Figure 1 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 2 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 3 for Decoupling Exploration and Exploitation in Reinforcement Learning
Figure 4 for Decoupling Exploration and Exploitation in Reinforcement Learning
Viaarxiv icon