Alert button
Picture for Vinicius Zambaldi

Vinicius Zambaldi

Alert button

The Advantage Regret-Matching Actor-Critic

Add code
Bookmark button
Alert button
Aug 27, 2020
Audrūnas Gruslys, Marc Lanctot, Rémi Munos, Finbarr Timbers, Martin Schmid, Julien Perolat, Dustin Morrill, Vinicius Zambaldi, Jean-Baptiste Lespiau, John Schultz, Mohammad Gheshlaghi Azar, Michael Bowling, Karl Tuyls

Figure 1 for The Advantage Regret-Matching Actor-Critic
Figure 2 for The Advantage Regret-Matching Actor-Critic
Figure 3 for The Advantage Regret-Matching Actor-Critic
Figure 4 for The Advantage Regret-Matching Actor-Critic
Viaarxiv icon

MEMO: A Deep Network for Flexible Combination of Episodic Memories

Add code
Bookmark button
Alert button
Jan 29, 2020
Andrea Banino, Adrià Puigdomènech Badia, Raphael Köster, Martin J. Chadwick, Vinicius Zambaldi, Demis Hassabis, Caswell Barry, Matthew Botvinick, Dharshan Kumaran, Charles Blundell

Figure 1 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 2 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 3 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Figure 4 for MEMO: A Deep Network for Flexible Combination of Episodic Memories
Viaarxiv icon

OpenSpiel: A Framework for Reinforcement Learning in Games

Add code
Bookmark button
Alert button
Oct 10, 2019
Marc Lanctot, Edward Lockhart, Jean-Baptiste Lespiau, Vinicius Zambaldi, Satyaki Upadhyay, Julien Pérolat, Sriram Srinivasan, Finbarr Timbers, Karl Tuyls, Shayegan Omidshafiei, Daniel Hennes, Dustin Morrill, Paul Muller, Timo Ewalds, Ryan Faulkner, János Kramár, Bart De Vylder, Brennan Saeta, James Bradbury, David Ding, Sebastian Borgeaud, Matthew Lai, Julian Schrittwieser, Thomas Anthony, Edward Hughes, Ivo Danihelka, Jonah Ryan-Davis

Figure 1 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 2 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 3 for OpenSpiel: A Framework for Reinforcement Learning in Games
Figure 4 for OpenSpiel: A Framework for Reinforcement Learning in Games
Viaarxiv icon

Compositional Imitation Learning: Explaining and executing one task at a time

Add code
Bookmark button
Alert button
Dec 04, 2018
Thomas Kipf, Yujia Li, Hanjun Dai, Vinicius Zambaldi, Edward Grefenstette, Pushmeet Kohli, Peter Battaglia

Figure 1 for Compositional Imitation Learning: Explaining and executing one task at a time
Figure 2 for Compositional Imitation Learning: Explaining and executing one task at a time
Figure 3 for Compositional Imitation Learning: Explaining and executing one task at a time
Figure 4 for Compositional Imitation Learning: Explaining and executing one task at a time
Viaarxiv icon

Actor-Critic Policy Optimization in Partially Observable Multiagent Environments

Add code
Bookmark button
Alert button
Oct 21, 2018
Sriram Srinivasan, Marc Lanctot, Vinicius Zambaldi, Julien Perolat, Karl Tuyls, Remi Munos, Michael Bowling

Figure 1 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 2 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 3 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Figure 4 for Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Viaarxiv icon

Relational inductive biases, deep learning, and graph networks

Add code
Bookmark button
Alert button
Oct 17, 2018
Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals, Yujia Li, Razvan Pascanu

Figure 1 for Relational inductive biases, deep learning, and graph networks
Figure 2 for Relational inductive biases, deep learning, and graph networks
Figure 3 for Relational inductive biases, deep learning, and graph networks
Figure 4 for Relational inductive biases, deep learning, and graph networks
Viaarxiv icon

Relational Forward Models for Multi-Agent Learning

Add code
Bookmark button
Alert button
Sep 28, 2018
Andrea Tacchetti, H. Francis Song, Pedro A. M. Mediano, Vinicius Zambaldi, Neil C. Rabinowitz, Thore Graepel, Matthew Botvinick, Peter W. Battaglia

Figure 1 for Relational Forward Models for Multi-Agent Learning
Figure 2 for Relational Forward Models for Multi-Agent Learning
Figure 3 for Relational Forward Models for Multi-Agent Learning
Figure 4 for Relational Forward Models for Multi-Agent Learning
Viaarxiv icon

Relational Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 28, 2018
Vinicius Zambaldi, David Raposo, Adam Santoro, Victor Bapst, Yujia Li, Igor Babuschkin, Karl Tuyls, David Reichert, Timothy Lillicrap, Edward Lockhart, Murray Shanahan, Victoria Langston, Razvan Pascanu, Matthew Botvinick, Oriol Vinyals, Peter Battaglia

Figure 1 for Relational Deep Reinforcement Learning
Figure 2 for Relational Deep Reinforcement Learning
Figure 3 for Relational Deep Reinforcement Learning
Figure 4 for Relational Deep Reinforcement Learning
Viaarxiv icon