Alert button
Picture for Florian Strub

Florian Strub

Alert button

Language Evolution with Deep Learning

Add code
Bookmark button
Alert button
Mar 18, 2024
Mathieu Rita, Paul Michel, Rahma Chaabouni, Olivier Pietquin, Emmanuel Dupoux, Florian Strub

Figure 1 for Language Evolution with Deep Learning
Figure 2 for Language Evolution with Deep Learning
Figure 3 for Language Evolution with Deep Learning
Figure 4 for Language Evolution with Deep Learning
Viaarxiv icon

Language Model Alignment with Elastic Reset

Add code
Bookmark button
Alert button
Dec 06, 2023
Michael Noukhovitch, Samuel Lavoie, Florian Strub, Aaron Courville

Viaarxiv icon

The Edge of Orthogonality: A Simple View of What Makes BYOL Tick

Add code
Bookmark button
Alert button
Feb 09, 2023
Pierre H. Richemond, Allison Tam, Yunhao Tang, Florian Strub, Bilal Piot, Felix Hill

Figure 1 for The Edge of Orthogonality: A Simple View of What Makes BYOL Tick
Figure 2 for The Edge of Orthogonality: A Simple View of What Makes BYOL Tick
Figure 3 for The Edge of Orthogonality: A Simple View of What Makes BYOL Tick
Figure 4 for The Edge of Orthogonality: A Simple View of What Makes BYOL Tick
Viaarxiv icon

SemPPL: Predicting pseudo-labels for better contrastive representations

Add code
Bookmark button
Alert button
Jan 12, 2023
Matko Bošnjak, Pierre H. Richemond, Nenad Tomasev, Florian Strub, Jacob C. Walker, Felix Hill, Lars Holger Buesing, Razvan Pascanu, Charles Blundell, Jovana Mitrovic

Figure 1 for SemPPL: Predicting pseudo-labels for better contrastive representations
Figure 2 for SemPPL: Predicting pseudo-labels for better contrastive representations
Figure 3 for SemPPL: Predicting pseudo-labels for better contrastive representations
Figure 4 for SemPPL: Predicting pseudo-labels for better contrastive representations
Viaarxiv icon

Over-communicate no more: Situated RL agents learn concise communication protocols

Add code
Bookmark button
Alert button
Nov 02, 2022
Aleksandra Kalinowska, Elnaz Davoodi, Florian Strub, Kory W Mathewson, Ivana Kajic, Michael Bowling, Todd D Murphey, Patrick M Pilarski

Figure 1 for Over-communicate no more: Situated RL agents learn concise communication protocols
Figure 2 for Over-communicate no more: Situated RL agents learn concise communication protocols
Figure 3 for Over-communicate no more: Situated RL agents learn concise communication protocols
Figure 4 for Over-communicate no more: Situated RL agents learn concise communication protocols
Viaarxiv icon

Emergent Communication: Generalization and Overfitting in Lewis Games

Add code
Bookmark button
Alert button
Sep 30, 2022
Mathieu Rita, Corentin Tallec, Paul Michel, Jean-Bastien Grill, Olivier Pietquin, Emmanuel Dupoux, Florian Strub

Figure 1 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 2 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 3 for Emergent Communication: Generalization and Overfitting in Lewis Games
Figure 4 for Emergent Communication: Generalization and Overfitting in Lewis Games
Viaarxiv icon

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

Add code
Bookmark button
Alert button
Sep 22, 2022
Ian Gemp, Thomas Anthony, Yoram Bachrach, Avishkar Bhoopchand, Kalesha Bullard, Jerome Connor, Vibhavari Dasagi, Bart De Vylder, Edgar Duenez-Guzman, Romuald Elie, Richard Everett, Daniel Hennes, Edward Hughes, Mina Khan, Marc Lanctot, Kate Larson, Guy Lever, Siqi Liu, Luke Marris, Kevin R. McKee, Paul Muller, Julien Perolat, Florian Strub, Andrea Tacchetti, Eugene Tarassov, Zhe Wang, Karl Tuyls

Viaarxiv icon

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 30, 2022
Julien Perolat, Bart de Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T. Connor, Neil Burch, Thomas Anthony, Stephen McAleer, Romuald Elie, Sarah H. Cen, Zhe Wang, Audrunas Gruslys, Aleksandra Malysheva, Mina Khan, Sherjil Ozair, Finbarr Timbers, Toby Pohlen, Tom Eccles, Mark Rowland, Marc Lanctot, Jean-Baptiste Lespiau, Bilal Piot, Shayegan Omidshafiei, Edward Lockhart, Laurent Sifre, Nathalie Beauguerlange, Remi Munos, David Silver, Satinder Singh, Demis Hassabis, Karl Tuyls

Figure 1 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 2 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 3 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Figure 4 for Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning
Viaarxiv icon

Learning Natural Language Generation from Scratch

Add code
Bookmark button
Alert button
Sep 20, 2021
Alice Martin Donati, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin

Figure 1 for Learning Natural Language Generation from Scratch
Figure 2 for Learning Natural Language Generation from Scratch
Figure 3 for Learning Natural Language Generation from Scratch
Figure 4 for Learning Natural Language Generation from Scratch
Viaarxiv icon

Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness

Add code
Bookmark button
Alert button
May 31, 2021
Mathieu Seurin, Florian Strub, Philippe Preux, Olivier Pietquin

Figure 1 for Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
Figure 2 for Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
Figure 3 for Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
Figure 4 for Don't Do What Doesn't Matter: Intrinsic Motivation with Action Usefulness
Viaarxiv icon