Alert button
Picture for Jakub Grudzien Kuba

Jakub Grudzien Kuba

Alert button

Functional Graphical Models: Structure Enables Offline Data-Driven Optimization

Add code
Bookmark button
Alert button
Jan 12, 2024
Jakub Grudzien Kuba, Masatoshi Uehara, Pieter Abbeel, Sergey Levine

Viaarxiv icon

IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies

Add code
Bookmark button
Alert button
Apr 20, 2023
Philippe Hansen-Estruch, Ilya Kostrikov, Michael Janner, Jakub Grudzien Kuba, Sergey Levine

Figure 1 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 2 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 3 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Figure 4 for IDQL: Implicit Q-Learning as an Actor-Critic Method with Diffusion Policies
Viaarxiv icon

Heterogeneous-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 19, 2023
Yifan Zhong, Jakub Grudzien Kuba, Siyi Hu, Jiaming Ji, Yaodong Yang

Figure 1 for Heterogeneous-Agent Reinforcement Learning
Figure 2 for Heterogeneous-Agent Reinforcement Learning
Figure 3 for Heterogeneous-Agent Reinforcement Learning
Figure 4 for Heterogeneous-Agent Reinforcement Learning
Viaarxiv icon

Discovered Policy Optimisation

Add code
Bookmark button
Alert button
Oct 13, 2022
Chris Lu, Jakub Grudzien Kuba, Alistair Letcher, Luke Metz, Christian Schroeder de Witt, Jakob Foerster

Figure 1 for Discovered Policy Optimisation
Figure 2 for Discovered Policy Optimisation
Figure 3 for Discovered Policy Optimisation
Figure 4 for Discovered Policy Optimisation
Viaarxiv icon

Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL

Add code
Bookmark button
Alert button
Aug 02, 2022
Jakub Grudzien Kuba, Xidong Feng, Shiyao Ding, Hao Dong, Jun Wang, Yaodong Yang

Figure 1 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 2 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 3 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Figure 4 for Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL
Viaarxiv icon

Multi-Agent Reinforcement Learning is a Sequence Modeling Problem

Add code
Bookmark button
Alert button
May 30, 2022
Muning Wen, Jakub Grudzien Kuba, Runji Lin, Weinan Zhang, Ying Wen, Jun Wang, Yaodong Yang

Figure 1 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 2 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 3 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Figure 4 for Multi-Agent Reinforcement Learning is a Sequence Modeling Problem
Viaarxiv icon

Understanding Value Decomposition Algorithms in Deep Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Feb 16, 2022
Zehao Dou, Jakub Grudzien Kuba, Yaodong Yang

Viaarxiv icon

Mirror Learning: A Unifying Framework of Policy Optimisation

Add code
Bookmark button
Alert button
Feb 02, 2022
Jakub Grudzien Kuba, Christian Schroeder de Witt, Jakob Foerster

Figure 1 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 2 for Mirror Learning: A Unifying Framework of Policy Optimisation
Figure 3 for Mirror Learning: A Unifying Framework of Policy Optimisation
Viaarxiv icon