Alert button
Picture for Martin Klissarov

Martin Klissarov

Alert button

Code as Reward: Empowering Reinforcement Learning with VLMs

Feb 07, 2024
David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand

Viaarxiv icon

Motif: Intrinsic Motivation from Artificial Intelligence Feedback

Sep 29, 2023
Martin Klissarov, Pierluca D'Oro, Shagun Sodhani, Roberta Raileanu, Pierre-Luc Bacon, Pascal Vincent, Amy Zhang, Mikael Henaff

Figure 1 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 2 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 3 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Figure 4 for Motif: Intrinsic Motivation from Artificial Intelligence Feedback
Viaarxiv icon

Deep Laplacian-based Options for Temporally-Extended Exploration

Jan 26, 2023
Martin Klissarov, Marlos C. Machado

Figure 1 for Deep Laplacian-based Options for Temporally-Extended Exploration
Figure 2 for Deep Laplacian-based Options for Temporally-Extended Exploration
Figure 3 for Deep Laplacian-based Options for Temporally-Extended Exploration
Figure 4 for Deep Laplacian-based Options for Temporally-Extended Exploration
Viaarxiv icon

Flexible Option Learning

Dec 06, 2021
Martin Klissarov, Doina Precup

Figure 1 for Flexible Option Learning
Figure 2 for Flexible Option Learning
Figure 3 for Flexible Option Learning
Figure 4 for Flexible Option Learning
Viaarxiv icon

Reward Propagation Using Graph Convolutional Networks

Oct 06, 2020
Martin Klissarov, Doina Precup

Figure 1 for Reward Propagation Using Graph Convolutional Networks
Figure 2 for Reward Propagation Using Graph Convolutional Networks
Figure 3 for Reward Propagation Using Graph Convolutional Networks
Figure 4 for Reward Propagation Using Graph Convolutional Networks
Viaarxiv icon

Options of Interest: Temporal Abstraction with Interest Functions

Jan 01, 2020
Khimya Khetarpal, Martin Klissarov, Maxime Chevalier-Boisvert, Pierre-Luc Bacon, Doina Precup

Figure 1 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 2 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 3 for Options of Interest: Temporal Abstraction with Interest Functions
Figure 4 for Options of Interest: Temporal Abstraction with Interest Functions
Viaarxiv icon

Learnings Options End-to-End for Continuous Action Tasks

Nov 30, 2017
Martin Klissarov, Pierre-Luc Bacon, Jean Harb, Doina Precup

Figure 1 for Learnings Options End-to-End for Continuous Action Tasks
Viaarxiv icon

When Waiting is not an Option : Learning Options with a Deliberation Cost

Sep 14, 2017
Jean Harb, Pierre-Luc Bacon, Martin Klissarov, Doina Precup

Figure 1 for When Waiting is not an Option : Learning Options with a Deliberation Cost
Figure 2 for When Waiting is not an Option : Learning Options with a Deliberation Cost
Figure 3 for When Waiting is not an Option : Learning Options with a Deliberation Cost
Viaarxiv icon