Alert button
Picture for Michael Noukhovitch

Michael Noukhovitch

Alert button

The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization

Add code
Bookmark button
Alert button
Mar 24, 2024
Shengyi Huang, Michael Noukhovitch, Arian Hosseini, Kashif Rasul, Weixun Wang, Lewis Tunstall

Viaarxiv icon

Language Model Alignment with Elastic Reset

Add code
Bookmark button
Alert button
Dec 06, 2023
Michael Noukhovitch, Samuel Lavoie, Florian Strub, Aaron Courville

Viaarxiv icon

Learning to Communicate using Contrastive Learning

Add code
Bookmark button
Alert button
Jul 03, 2023
Yat Long Lo, Biswa Sengupta, Jakob Foerster, Michael Noukhovitch

Figure 1 for Learning to Communicate using Contrastive Learning
Figure 2 for Learning to Communicate using Contrastive Learning
Figure 3 for Learning to Communicate using Contrastive Learning
Figure 4 for Learning to Communicate using Contrastive Learning
Viaarxiv icon

Pretraining Representations for Data-Efficient Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 09, 2021
Max Schwarzer, Nitarshan Rajkumar, Michael Noukhovitch, Ankesh Anand, Laurent Charlin, Devon Hjelm, Philip Bachman, Aaron Courville

Figure 1 for Pretraining Representations for Data-Efficient Reinforcement Learning
Figure 2 for Pretraining Representations for Data-Efficient Reinforcement Learning
Figure 3 for Pretraining Representations for Data-Efficient Reinforcement Learning
Figure 4 for Pretraining Representations for Data-Efficient Reinforcement Learning
Viaarxiv icon

Emergent Communication under Competition

Add code
Bookmark button
Alert button
Jan 25, 2021
Michael Noukhovitch, Travis LaCroix, Angeliki Lazaridou, Aaron Courville

Figure 1 for Emergent Communication under Competition
Figure 2 for Emergent Communication under Competition
Figure 3 for Emergent Communication under Competition
Figure 4 for Emergent Communication under Competition
Viaarxiv icon

Systematic Generalization: What Is Required and Can It Be Learned?

Add code
Bookmark button
Alert button
Nov 30, 2018
Dzmitry Bahdanau, Shikhar Murty, Michael Noukhovitch, Thien Huu Nguyen, Harm de Vries, Aaron Courville

Figure 1 for Systematic Generalization: What Is Required and Can It Be Learned?
Figure 2 for Systematic Generalization: What Is Required and Can It Be Learned?
Figure 3 for Systematic Generalization: What Is Required and Can It Be Learned?
Figure 4 for Systematic Generalization: What Is Required and Can It Be Learned?
Viaarxiv icon

Commonsense mining as knowledge base completion? A study on the impact of novelty

Add code
Bookmark button
Alert button
Apr 24, 2018
Stanisław Jastrzębski, Dzmitry Bahdanau, Seyedarian Hosseini, Michael Noukhovitch, Yoshua Bengio, Jackie Chi Kit Cheung

Figure 1 for Commonsense mining as knowledge base completion? A study on the impact of novelty
Figure 2 for Commonsense mining as knowledge base completion? A study on the impact of novelty
Figure 3 for Commonsense mining as knowledge base completion? A study on the impact of novelty
Figure 4 for Commonsense mining as knowledge base completion? A study on the impact of novelty
Viaarxiv icon