Alert button
Picture for Doina Precup

Doina Precup

Alert button

Offline Multitask Representation Learning for Reinforcement Learning

Add code
Bookmark button
Alert button
Mar 18, 2024
Haque Ishfaq, Thanh Nguyen-Tang, Songtao Feng, Raman Arora, Mengdi Wang, Ming Yin, Doina Precup

Figure 1 for Offline Multitask Representation Learning for Reinforcement Learning
Viaarxiv icon

Discrete Probabilistic Inference as Control in Multi-path Environments

Add code
Bookmark button
Alert button
Feb 15, 2024
Tristan Deleu, Padideh Nouri, Nikolay Malkin, Doina Precup, Yoshua Bengio

Viaarxiv icon

Mixtures of Experts Unlock Parameter Scaling for Deep RL

Add code
Bookmark button
Alert button
Feb 13, 2024
Johan Obando-Ceron, Ghada Sokar, Timon Willi, Clare Lyle, Jesse Farebrother, Jakob Foerster, Gintare Karolina Dziugaite, Doina Precup, Pablo Samuel Castro

Viaarxiv icon

On the Privacy of Selection Mechanisms with Gaussian Noise

Add code
Bookmark button
Alert button
Feb 09, 2024
Jonathan Lebensold, Doina Precup, Borja Balle

Viaarxiv icon

QGFN: Controllable Greediness with Action Values

Add code
Bookmark button
Alert button
Feb 07, 2024
Elaine Lau, Stephen Zhewen Lu, Ling Pan, Doina Precup, Emmanuel Bengio

Viaarxiv icon

Code as Reward: Empowering Reinforcement Learning with VLMs

Add code
Bookmark button
Alert button
Feb 07, 2024
David Venuto, Sami Nur Islam, Martin Klissarov, Doina Precup, Sherry Yang, Ankit Anand

Viaarxiv icon

Effective Protein-Protein Interaction Exploration with PPIretrieval

Add code
Bookmark button
Alert button
Feb 06, 2024
Chenqing Hua, Connor Coley, Guy Wolf, Doina Precup, Shuangjia Zheng

Viaarxiv icon

Prediction and Control in Continual Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 18, 2023
Nishanth Anand, Doina Precup

Viaarxiv icon

Nash Learning from Human Feedback

Add code
Bookmark button
Alert button
Dec 06, 2023
Rémi Munos, Michal Valko, Daniele Calandriello, Mohammad Gheshlaghi Azar, Mark Rowland, Zhaohan Daniel Guo, Yunhao Tang, Matthieu Geist, Thomas Mesnard, Andrea Michi, Marco Selvi, Sertan Girgin, Nikola Momchev, Olivier Bachem, Daniel J. Mankowitz, Doina Precup, Bilal Piot

Figure 1 for Nash Learning from Human Feedback
Figure 2 for Nash Learning from Human Feedback
Figure 3 for Nash Learning from Human Feedback
Figure 4 for Nash Learning from Human Feedback
Viaarxiv icon

Learning domain-invariant classifiers for infant cry sounds

Add code
Bookmark button
Alert button
Nov 30, 2023
Charles C. Onu, Hemanth K. Sheetha, Arsenii Gorin, Doina Precup

Viaarxiv icon