Picture for Jakub Grudzien Kuba

Jakub Grudzien Kuba

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization

Add code
Jan 12, 2024
Viaarxiv icon

IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies

Add code
Apr 20, 2023
Viaarxiv icon

Heterogeneous-Agent Reinforcement Learning

Add code
Apr 19, 2023
Viaarxiv icon

Discovered Policy Optimisation

Add code
Oct 13, 2022
Figure 1 for Discovered Policy Optimisation
Figure 2 for Discovered Policy Optimisation
Figure 3 for Discovered Policy Optimisation
Figure 4 for Discovered Policy Optimisation
Viaarxiv icon

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL

Add code
Aug 02, 2022
Figure 1 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 2 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 3 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 4 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Viaarxiv icon

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

Add code
May 30, 2022
Figure 1 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 2 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 3 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 4 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Viaarxiv icon

Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning

Add code
Feb 16, 2022
Viaarxiv icon

Mirror Learning: A Unifying Framework of Policy Optimisation

Add code
Feb 02, 2022
Figure 1 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 2 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 3 for Mirror Learning: A Unifying Framework of Policy Optimisation
Viaarxiv icon

Multi-Agent Constrained Policy Optimisation

Add code
Oct 06, 2021
Figure 1 for Multi-Agent Constrained Policy Optimisation
Figure 2 for Multi-Agent Constrained Policy Optimisation
Figure 3 for Multi-Agent Constrained Policy Optimisation
Figure 4 for Multi-Agent Constrained Policy Optimisation
Viaarxiv icon

Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning

Add code
Sep 23, 2021
Figure 1 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 2 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 3 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Figure 4 for Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning
Viaarxiv icon