Alert button
Picture for Doina Precup

Doina Precup

Alert button

On the Convergence of Bounded Agents

Add code
Bookmark button
Alert button
Jul 20, 2023
David Abel, André Barreto, Hado van Hasselt, Benjamin Van Roy, Doina Precup, Satinder Singh

Figure 1 for On the Convergence of Bounded Agents
Viaarxiv icon

An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets

Add code
Bookmark button
Alert button
Jul 18, 2023
Nikhil Vemgal, Elaine Lau, Doina Precup

Figure 1 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Figure 2 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Figure 3 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Figure 4 for An Empirical Study of the Effectiveness of Using a Replay Buffer on Mode Discovery in GFlowNets
Viaarxiv icon

Optimism and Adaptivity in Policy Optimization

Add code
Bookmark button
Alert button
Jun 18, 2023
Veronica Chelu, Tom Zahavy, Arthur Guez, Doina Precup, Sebastian Flennerhag

Figure 1 for Optimism and Adaptivity in Policy Optimization
Figure 2 for Optimism and Adaptivity in Policy Optimization
Figure 3 for Optimism and Adaptivity in Policy Optimization
Figure 4 for Optimism and Adaptivity in Policy Optimization
Viaarxiv icon

For SALE: State-Action Representation Learning for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 04, 2023
Scott Fujimoto, Wei-Di Chang, Edward J. Smith, Shixiang Shane Gu, Doina Precup, David Meger

Figure 1 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 2 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 3 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 4 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Viaarxiv icon

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

Add code
Bookmark button
Alert button
May 29, 2023
Haque Ishfaq, Qingfeng Lan, Pan Xu, A. Rupam Mahmood, Doina Precup, Anima Anandkumar, Kamyar Azizzadenesheli

Figure 1 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 2 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 3 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 4 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Viaarxiv icon

Policy Gradient Methods in the Presence of Symmetries and State Abstractions

Add code
Bookmark button
Alert button
May 09, 2023
Prakash Panangaden, Sahand Rezaei-Shoshtari, Rosie Zhao, David Meger, Doina Precup

Figure 1 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 2 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 3 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 4 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Viaarxiv icon

MUDiff: Unified Diffusion for Complete Molecule Generation

Add code
Bookmark button
Alert button
Apr 28, 2023
Chenqing Hua, Sitao Luan, Minkai Xu, Rex Ying, Jie Fu, Stefano Ermon, Doina Precup

Figure 1 for MUDiff: Unified Diffusion for Complete Molecule Generation
Figure 2 for MUDiff: Unified Diffusion for Complete Molecule Generation
Figure 3 for MUDiff: Unified Diffusion for Complete Molecule Generation
Figure 4 for MUDiff: Unified Diffusion for Complete Molecule Generation
Viaarxiv icon

When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability

Add code
Bookmark button
Alert button
Apr 25, 2023
Sitao Luan, Chenqing Hua, Minkai Xu, Qincheng Lu, Jiaqi Zhu, Xiao-Wen Chang, Jie Fu, Jure Leskovec, Doina Precup

Figure 1 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Figure 2 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Figure 3 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Figure 4 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Viaarxiv icon

Accelerating exploration and representation learning with offline pre-training

Add code
Bookmark button
Alert button
Mar 31, 2023
Bogdan Mazoure, Jake Bruce, Doina Precup, Rob Fergus, Ankit Anand

Figure 1 for Accelerating exploration and representation learning with offline pre-training
Figure 2 for Accelerating exploration and representation learning with offline pre-training
Figure 3 for Accelerating exploration and representation learning with offline pre-training
Figure 4 for Accelerating exploration and representation learning with offline pre-training
Viaarxiv icon

The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

Add code
Bookmark button
Alert button
Feb 14, 2023
Kushal Arora, Timothy J. O'Donnell, Doina Precup, Jason Weston, Jackie C. K. Cheung

Figure 1 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Figure 2 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Figure 3 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Figure 4 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Viaarxiv icon