Picture for Doina Precup

Doina Precup

McGill University, Mila- Quebec Artificial Intelligence Institute

For SALE: State-Action Representation Learning for Deep Reinforcement Learning

Add code
Jun 04, 2023
Figure 1 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 2 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 3 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Figure 4 for For SALE: State-Action Representation Learning for Deep Reinforcement Learning
Viaarxiv icon

Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo

Add code
May 29, 2023
Figure 1 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 2 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 3 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Figure 4 for Provable and Practical: Efficient Exploration in Reinforcement Learning via Langevin Monte Carlo
Viaarxiv icon

Policy Gradient Methods in the Presence of Symmetries and State Abstractions

Add code
May 09, 2023
Figure 1 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 2 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 3 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Figure 4 for Policy Gradient Methods in the Presence of Symmetries and State Abstractions
Viaarxiv icon

MUDiff: Unified Diffusion for Complete Molecule Generation

Add code
Apr 28, 2023
Viaarxiv icon

When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability

Add code
Apr 25, 2023
Figure 1 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Figure 2 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Figure 3 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Figure 4 for When Do Graph Neural Networks Help with Node Classification: Investigating the Homophily Principle on Node Distinguishability
Viaarxiv icon

Accelerating exploration and representation learning with offline pre-training

Add code
Mar 31, 2023
Figure 1 for Accelerating exploration and representation learning with offline pre-training
Figure 2 for Accelerating exploration and representation learning with offline pre-training
Figure 3 for Accelerating exploration and representation learning with offline pre-training
Figure 4 for Accelerating exploration and representation learning with offline pre-training
Viaarxiv icon

The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

Add code
Feb 14, 2023
Figure 1 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Figure 2 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Figure 3 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Figure 4 for The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation
Viaarxiv icon

Minimal Value-Equivalent Partial Models for Scalable and Robust Planning in Lifelong Reinforcement Learning

Add code
Jan 24, 2023
Viaarxiv icon

On the Challenges of using Reinforcement Learning in Precision Drug Dosing: Delay and Prolongedness of Action Effects

Add code
Jan 02, 2023
Viaarxiv icon

Offline Policy Optimization in RL with Variance Regularizaton

Add code
Dec 29, 2022
Figure 1 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 2 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 3 for Offline Policy Optimization in RL with Variance Regularizaton
Figure 4 for Offline Policy Optimization in RL with Variance Regularizaton
Viaarxiv icon