Alert button
Picture for Gheorghe Comanici

Gheorghe Comanici

Alert button

Vision-Language Models as a Source of Rewards

Add code
Bookmark button
Alert button
Dec 14, 2023
Kate Baumli, Satinder Baveja, Feryal Behbahani, Harris Chan, Gheorghe Comanici, Sebastian Flennerhag, Maxime Gazeau, Kristian Holsheimer, Dan Horgan, Michael Laskin, Clare Lyle, Hussain Masoom, Kay McKinney, Volodymyr Mnih, Alexander Neitz, Fabio Pardo, Jack Parker-Holder, John Quan, Tim Rocktäschel, Himanshu Sahni, Tom Schaul, Yannick Schroecker, Stephen Spencer, Richie Steigerwald, Luyu Wang, Lei Zhang

Viaarxiv icon

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Add code
Bookmark button
Alert button
Nov 06, 2023
Abbas Mehrabian, Ankit Anand, Hyunjik Kim, Nicolas Sonnerat, Matej Balog, Gheorghe Comanici, Tudor Berariu, Andrew Lee, Anian Ruoss, Anna Bulanova, Daniel Toyama, Sam Blackwell, Bernardino Romera Paredes, Petar Veličković, Laurent Orseau, Joonkyung Lee, Anurag Murty Naredla, Doina Precup, Adam Zsolt Wagner

Figure 1 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 2 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 3 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 4 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Viaarxiv icon

Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 21, 2022
Gheorghe Comanici, Amelia Glaese, Anita Gergely, Daniel Toyama, Zafarali Ahmed, Tyler Jackson, Philippe Hamel, Doina Precup

Figure 1 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 2 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 3 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 4 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Viaarxiv icon

Temporally Abstract Partial Models

Add code
Bookmark button
Alert button
Aug 06, 2021
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, Doina Precup

Figure 1 for Temporally Abstract Partial Models
Figure 2 for Temporally Abstract Partial Models
Figure 3 for Temporally Abstract Partial Models
Figure 4 for Temporally Abstract Partial Models
Viaarxiv icon

The Option Keyboard: Combining Skills in Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 24, 2021
André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan Hunt, Shibl Mourad, David Silver, Doina Precup

Figure 1 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 2 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 3 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 4 for The Option Keyboard: Combining Skills in Reinforcement Learning
Viaarxiv icon

AndroidEnv: A Reinforcement Learning Platform for Android

Add code
Bookmark button
Alert button
May 27, 2021
Daniel Toyama, Philippe Hamel, Anita Gergely, Gheorghe Comanici, Amelia Glaese, Zafarali Ahmed, Tyler Jackson, Shibl Mourad, Doina Precup

Figure 1 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 2 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 3 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 4 for AndroidEnv: A Reinforcement Learning Platform for Android
Viaarxiv icon

What can I do here? A Theory of Affordances in Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 26, 2020
Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici, David Abel, Doina Precup

Figure 1 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 2 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 3 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 4 for What can I do here? A Theory of Affordances in Reinforcement Learning
Viaarxiv icon