Alert button
Picture for Stefano V. Albrecht

Stefano V. Albrecht

Alert button

Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 02, 2023
Lukas Schäfer, Oliver Slumbers, Stephen McAleer, Yali Du, Stefano V. Albrecht, David Mguni

Figure 1 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 2 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 3 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Figure 4 for Ensemble Value Functions for Efficient Exploration in Multi-Agent Reinforcement Learning
Viaarxiv icon

Revisiting the Gumbel-Softmax in MADDPG

Add code
Bookmark button
Alert button
Feb 23, 2023
Callum Rhys Tilbury, Filippos Christianos, Stefano V. Albrecht

Figure 1 for Revisiting the Gumbel-Softmax in MADDPG
Figure 2 for Revisiting the Gumbel-Softmax in MADDPG
Figure 3 for Revisiting the Gumbel-Softmax in MADDPG
Figure 4 for Revisiting the Gumbel-Softmax in MADDPG
Viaarxiv icon

Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making

Add code
Bookmark button
Alert button
Feb 21, 2023
Balint Gyevnar, Cheng Wang, Christopher G. Lucas, Shay B. Cohen, Stefano V. Albrecht

Figure 1 for Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making
Figure 2 for Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making
Figure 3 for Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making
Figure 4 for Causal Social Explanations for Stochastic Sequential Multi-Agent Decision-Making
Viaarxiv icon

Learning Complex Teamwork Tasks using a Sub-task Curriculum

Add code
Bookmark button
Alert button
Feb 09, 2023
Elliot Fosong, Arrasy Rahman, Ignacio Carlucho, Stefano V. Albrecht

Figure 1 for Learning Complex Teamwork Tasks using a Sub-task Curriculum
Figure 2 for Learning Complex Teamwork Tasks using a Sub-task Curriculum
Figure 3 for Learning Complex Teamwork Tasks using a Sub-task Curriculum
Figure 4 for Learning Complex Teamwork Tasks using a Sub-task Curriculum
Viaarxiv icon

Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers

Add code
Bookmark button
Alert button
Dec 22, 2022
Aleksandar Krnjaic, Jonathan D. Thomas, Georgios Papoudakis, Lukas Schäfer, Peter Börsting, Stefano V. Albrecht

Figure 1 for Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Figure 2 for Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Figure 3 for Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Figure 4 for Scalable Multi-Agent Reinforcement Learning for Warehouse Logistics with Robotic and Human Co-Workers
Viaarxiv icon

Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models

Add code
Bookmark button
Alert button
Oct 26, 2022
Filippos Christianos, Peter Karkus, Boris Ivanovic, Stefano V. Albrecht, Marco Pavone

Figure 1 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 2 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 3 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Figure 4 for Planning with Occluded Traffic Agents using Bi-Level Variational Occlusion Models
Viaarxiv icon

DiPA: Diverse and Probabilistically Accurate Interactive Prediction

Add code
Bookmark button
Alert button
Oct 12, 2022
Anthony Knittel, Majd Hawasly, Stefano V. Albrecht, John Redford, Subramanian Ramamoorthy

Figure 1 for DiPA: Diverse and Probabilistically Accurate Interactive Prediction
Figure 2 for DiPA: Diverse and Probabilistically Accurate Interactive Prediction
Figure 3 for DiPA: Diverse and Probabilistically Accurate Interactive Prediction
Figure 4 for DiPA: Diverse and Probabilistically Accurate Interactive Prediction
Viaarxiv icon

A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning

Add code
Bookmark button
Alert button
Oct 11, 2022
Arrasy Rahman, Ignacio Carlucho, Niklas Höpner, Stefano V. Albrecht

Figure 1 for A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Figure 2 for A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Figure 3 for A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Figure 4 for A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning
Viaarxiv icon