Picture for Aaron Courville

Aaron Courville

Universite de Montreal

Managing multiple agents by automatically adjusting incentives

Add code
Sep 03, 2024
Viaarxiv icon

SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning

Add code
Jul 03, 2024
Figure 1 for SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Figure 2 for SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Figure 3 for SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Figure 4 for SAFT: Towards Out-of-Distribution Generalization in Fine-Tuning
Viaarxiv icon

Multimodal foundation world models for generalist embodied agents

Add code
Jun 26, 2024
Viaarxiv icon

On the consistency of hyper-parameter selection in value-based deep reinforcement learning

Add code
Jun 25, 2024
Figure 1 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 2 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 3 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Figure 4 for On the consistency of hyper-parameter selection in value-based deep reinforcement learning
Viaarxiv icon

Advantage Alignment Algorithms

Add code
Jun 20, 2024
Figure 1 for Advantage Alignment Algorithms
Figure 2 for Advantage Alignment Algorithms
Figure 3 for Advantage Alignment Algorithms
Figure 4 for Advantage Alignment Algorithms
Viaarxiv icon

The Curse of Diversity in Ensemble-Based Exploration

Add code
May 07, 2024
Figure 1 for The Curse of Diversity in Ensemble-Based Exploration
Figure 2 for The Curse of Diversity in Ensemble-Based Exploration
Figure 3 for The Curse of Diversity in Ensemble-Based Exploration
Figure 4 for The Curse of Diversity in Ensemble-Based Exploration
Viaarxiv icon

LOQA: Learning with Opponent Q-Learning Awareness

Add code
May 02, 2024
Figure 1 for LOQA: Learning with Opponent Q-Learning Awareness
Figure 2 for LOQA: Learning with Opponent Q-Learning Awareness
Figure 3 for LOQA: Learning with Opponent Q-Learning Awareness
Figure 4 for LOQA: Learning with Opponent Q-Learning Awareness
Viaarxiv icon

Modeling Caption Diversity in Contrastive Vision-Language Pretraining

Add code
Apr 30, 2024
Figure 1 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 2 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 3 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Figure 4 for Modeling Caption Diversity in Contrastive Vision-Language Pretraining
Viaarxiv icon

SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision

Add code
Apr 24, 2024
Figure 1 for SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision
Figure 2 for SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision
Figure 3 for SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision
Figure 4 for SPARO: Selective Attention for Robust and Compositional Transformer Encodings for Vision
Viaarxiv icon

Best Response Shaping

Add code
Apr 05, 2024
Figure 1 for Best Response Shaping
Figure 2 for Best Response Shaping
Figure 3 for Best Response Shaping
Figure 4 for Best Response Shaping
Viaarxiv icon