Picture for Gheorghe Comanici

Gheorghe Comanici

Using Reward Uncertainty to Induce Diverse Behaviour in Reinforcement Learning

Add code
Jun 02, 2026
Viaarxiv icon

Affordances Enable Partial World Modeling with LLMs

Add code
Feb 11, 2026
Viaarxiv icon

Vision-Language Models as a Source of Rewards

Add code
Dec 14, 2023
Figure 1 for Vision-Language Models as a Source of Rewards
Figure 2 for Vision-Language Models as a Source of Rewards
Figure 3 for Vision-Language Models as a Source of Rewards
Figure 4 for Vision-Language Models as a Source of Rewards
Viaarxiv icon

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Add code
Nov 06, 2023
Figure 1 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 2 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 3 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 4 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Viaarxiv icon

Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning

Add code
Apr 21, 2022
Figure 1 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 2 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 3 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Figure 4 for Learning how to Interact with a Complex Interface using Hierarchical Reinforcement Learning
Viaarxiv icon

Temporally Abstract Partial Models

Add code
Aug 06, 2021
Figure 1 for Temporally Abstract Partial Models
Figure 2 for Temporally Abstract Partial Models
Figure 3 for Temporally Abstract Partial Models
Figure 4 for Temporally Abstract Partial Models
Viaarxiv icon

The Option Keyboard: Combining Skills in Reinforcement Learning

Add code
Jun 24, 2021
Figure 1 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 2 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 3 for The Option Keyboard: Combining Skills in Reinforcement Learning
Figure 4 for The Option Keyboard: Combining Skills in Reinforcement Learning
Viaarxiv icon

AndroidEnv: A Reinforcement Learning Platform for Android

Add code
May 27, 2021
Figure 1 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 2 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 3 for AndroidEnv: A Reinforcement Learning Platform for Android
Figure 4 for AndroidEnv: A Reinforcement Learning Platform for Android
Viaarxiv icon

What can I do here? A Theory of Affordances in Reinforcement Learning

Add code
Jun 26, 2020
Figure 1 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 2 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 3 for What can I do here? A Theory of Affordances in Reinforcement Learning
Figure 4 for What can I do here? A Theory of Affordances in Reinforcement Learning
Viaarxiv icon