Picture for Doina Precup

Doina Precup

McGill University, Mila- Quebec Artificial Intelligence Institute

SCAR: Shapley Credit Assignment for More Efficient RLHF

Add code
May 26, 2025
Viaarxiv icon

Uncovering a Universal Abstract Algorithm for Modular Addition in Neural Networks

Add code
May 23, 2025
Viaarxiv icon

Plasticity as the Mirror of Empowerment

Add code
May 15, 2025
Viaarxiv icon

Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?

Add code
May 14, 2025
Viaarxiv icon

Cracking the Code of Action: a Generative Approach to Affordances for Reinforcement Learning

Add code
Apr 24, 2025
Viaarxiv icon

Capturing Individual Human Preferences with Reward Features

Add code
Mar 21, 2025
Viaarxiv icon

Agency Is Frame-Dependent

Add code
Feb 06, 2025
Figure 1 for Agency Is Frame-Dependent
Viaarxiv icon

Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning

Add code
Jan 29, 2025
Figure 1 for Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Figure 2 for Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Figure 3 for Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Figure 4 for Langevin Soft Actor-Critic: Efficient Exploration through Uncertainty-Driven Critic Learning
Viaarxiv icon

Fairness in Reinforcement Learning with Bisimulation Metrics

Add code
Dec 22, 2024
Figure 1 for Fairness in Reinforcement Learning with Bisimulation Metrics
Figure 2 for Fairness in Reinforcement Learning with Bisimulation Metrics
Figure 3 for Fairness in Reinforcement Learning with Bisimulation Metrics
Figure 4 for Fairness in Reinforcement Learning with Bisimulation Metrics
Viaarxiv icon

MaestroMotif: Skill Design from Artificial Intelligence Feedback

Add code
Dec 11, 2024
Figure 1 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 2 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 3 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Figure 4 for MaestroMotif: Skill Design from Artificial Intelligence Feedback
Viaarxiv icon