Picture for Doina Precup

Doina Precup

McGill University, Mila- Quebec Artificial Intelligence Institute

Discrete Probabilistic Inference as Control in Multi-path Environments

Add code
Feb 15, 2024
Figure 1 for Discrete Probabilistic Inference as Control in Multi-path Environments
Figure 2 for Discrete Probabilistic Inference as Control in Multi-path Environments
Figure 3 for Discrete Probabilistic Inference as Control in Multi-path Environments
Figure 4 for Discrete Probabilistic Inference as Control in Multi-path Environments
Viaarxiv icon

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Add code
Feb 13, 2024
Figure 1 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 2 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 3 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Figure 4 for Mixtures of Experts Unlock Parameter Scaling for Deep RL
Viaarxiv icon

On the Privacy of Selection Mechanisms with Gaussian Noise

Add code
Feb 09, 2024
Figure 1 for On the Privacy of Selection Mechanisms with Gaussian Noise
Figure 2 for On the Privacy of Selection Mechanisms with Gaussian Noise
Figure 3 for On the Privacy of Selection Mechanisms with Gaussian Noise
Figure 4 for On the Privacy of Selection Mechanisms with Gaussian Noise
Viaarxiv icon

QGFN: Controllable Greediness with Action Values

Add code
Feb 07, 2024
Figure 1 for QGFN: Controllable Greediness with Action Values
Figure 2 for QGFN: Controllable Greediness with Action Values
Figure 3 for QGFN: Controllable Greediness with Action Values
Figure 4 for QGFN: Controllable Greediness with Action Values
Viaarxiv icon

Code as Reward: Empowering Reinforcement Learning with VLMs

Add code
Feb 07, 2024
Figure 1 for Code as Reward: Empowering Reinforcement Learning with VLMs
Figure 2 for Code as Reward: Empowering Reinforcement Learning with VLMs
Figure 3 for Code as Reward: Empowering Reinforcement Learning with VLMs
Figure 4 for Code as Reward: Empowering Reinforcement Learning with VLMs
Viaarxiv icon

Effective Protein-Protein Interaction Exploration with PPIretrieval

Add code
Feb 06, 2024
Viaarxiv icon

Prediction and Control in Continual Reinforcement Learning

Add code
Dec 18, 2023
Viaarxiv icon

Nash Learning from Human Feedback

Add code
Dec 06, 2023
Figure 1 for Nash Learning from Human Feedback
Figure 2 for Nash Learning from Human Feedback
Figure 3 for Nash Learning from Human Feedback
Figure 4 for Nash Learning from Human Feedback
Viaarxiv icon

Learning domain-invariant classifiers for infant cry sounds

Add code
Nov 30, 2023
Figure 1 for Learning domain-invariant classifiers for infant cry sounds
Figure 2 for Learning domain-invariant classifiers for infant cry sounds
Figure 3 for Learning domain-invariant classifiers for infant cry sounds
Figure 4 for Learning domain-invariant classifiers for infant cry sounds
Viaarxiv icon

Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search

Add code
Nov 06, 2023
Figure 1 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 2 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 3 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Figure 4 for Finding Increasingly Large Extremal Graphs with AlphaZero and Tabu Search
Viaarxiv icon