Picture for Doina Precup

Doina Precup

McGill University, Mila- Quebec Artificial Intelligence Institute

DGFN: Double Generative Flow Networks

Add code
Nov 06, 2023
Figure 1 for DGFN: Double Generative Flow Networks
Figure 2 for DGFN: Double Generative Flow Networks
Figure 3 for DGFN: Double Generative Flow Networks
Figure 4 for DGFN: Double Generative Flow Networks
Viaarxiv icon

Conditions on Preference Relations that Guarantee the Existence of Optimal Policies

Add code
Nov 03, 2023
Viaarxiv icon

Forecaster: Towards Temporally Abstract Tree-Search Planning from Pixels

Add code
Oct 16, 2023
Viaarxiv icon

A cry for help: Early detection of brain injury in newborns

Add code
Oct 13, 2023
Figure 1 for A cry for help: Early detection of brain injury in newborns
Figure 2 for A cry for help: Early detection of brain injury in newborns
Figure 3 for A cry for help: Early detection of brain injury in newborns
Figure 4 for A cry for help: Early detection of brain injury in newborns
Viaarxiv icon

Combining Spatial and Temporal Abstraction in Planning for Better Generalization

Add code
Sep 30, 2023
Figure 1 for Combining Spatial and Temporal Abstraction in Planning for Better Generalization
Figure 2 for Combining Spatial and Temporal Abstraction in Planning for Better Generalization
Figure 3 for Combining Spatial and Temporal Abstraction in Planning for Better Generalization
Figure 4 for Combining Spatial and Temporal Abstraction in Planning for Better Generalization
Viaarxiv icon

Policy composition in reinforcement learning via multi-objective policy optimization

Add code
Aug 30, 2023
Viaarxiv icon

A Definition of Continual Reinforcement Learning

Add code
Jul 20, 2023
Viaarxiv icon

On the Convergence of Bounded Agents

Add code
Jul 20, 2023
Viaarxiv icon

An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Add code
Jul 18, 2023
Figure 1 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Figure 2 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Figure 3 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Figure 4 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Viaarxiv icon

Optimism and Adaptivity in Policy Optimization

Add code
Jun 18, 2023
Viaarxiv icon