Alert button
Picture for Max Schwarzer

Max Schwarzer

Alert button

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Add code
Bookmark button
Alert button
Mar 22, 2024
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Guoli Yin, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Figure 1 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 2 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 3 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Figure 4 for MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Viaarxiv icon

Learning and Controlling Silicon Dopant Transitions in Graphene using Scanning Transmission Electron Microscopy

Add code
Bookmark button
Alert button
Nov 21, 2023
Max Schwarzer, Jesse Farebrother, Joshua Greaves, Ekin Dogus Cubuk, Rishabh Agarwal, Aaron Courville, Marc G. Bellemare, Sergei Kalinin, Igor Mordatch, Pablo Samuel Castro, Kevin M. Roccapriore

Viaarxiv icon

Large Language Models as Generalizable Policies for Embodied Tasks

Add code
Bookmark button
Alert button
Oct 26, 2023
Andrew Szot, Max Schwarzer, Harsh Agrawal, Bogdan Mazoure, Walter Talbott, Katherine Metcalf, Natalie Mackraz, Devon Hjelm, Alexander Toshev

Viaarxiv icon

Bigger, Better, Faster: Human-level Atari with human-level efficiency

Add code
Bookmark button
Alert button
Jun 09, 2023
Max Schwarzer, Johan Obando-Ceron, Aaron Courville, Marc Bellemare, Rishabh Agarwal, Pablo Samuel Castro

Figure 1 for Bigger, Better, Faster: Human-level Atari with human-level efficiency
Figure 2 for Bigger, Better, Faster: Human-level Atari with human-level efficiency
Figure 3 for Bigger, Better, Faster: Human-level Atari with human-level efficiency
Figure 4 for Bigger, Better, Faster: Human-level Atari with human-level efficiency
Viaarxiv icon

Beyond Tabula Rasa: Reincarnating Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 03, 2022
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron Courville, Marc G. Bellemare

Figure 1 for Beyond Tabula Rasa: Reincarnating Reinforcement Learning
Figure 2 for Beyond Tabula Rasa: Reincarnating Reinforcement Learning
Figure 3 for Beyond Tabula Rasa: Reincarnating Reinforcement Learning
Figure 4 for Beyond Tabula Rasa: Reincarnating Reinforcement Learning
Viaarxiv icon

The Primacy Bias in Deep Reinforcement Learning

Add code
Bookmark button
Alert button
May 16, 2022
Evgenii Nikishin, Max Schwarzer, Pierluca D'Oro, Pierre-Luc Bacon, Aaron Courville

Figure 1 for The Primacy Bias in Deep Reinforcement Learning
Figure 2 for The Primacy Bias in Deep Reinforcement Learning
Figure 3 for The Primacy Bias in Deep Reinforcement Learning
Figure 4 for The Primacy Bias in Deep Reinforcement Learning
Viaarxiv icon

Simplicial Embeddings in Self-Supervised Learning and Downstream Classification

Add code
Bookmark button
Alert button
Apr 01, 2022
Samuel Lavoie, Christos Tsirigotis, Max Schwarzer, Kenji Kawaguchi, Ankit Vani, Aaron Courville

Figure 1 for Simplicial Embeddings in Self-Supervised Learning and Downstream Classification
Figure 2 for Simplicial Embeddings in Self-Supervised Learning and Downstream Classification
Figure 3 for Simplicial Embeddings in Self-Supervised Learning and Downstream Classification
Figure 4 for Simplicial Embeddings in Self-Supervised Learning and Downstream Classification
Viaarxiv icon

Deep Reinforcement Learning at the Edge of the Statistical Precipice

Add code
Bookmark button
Alert button
Aug 30, 2021
Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron Courville, Marc G. Bellemare

Figure 1 for Deep Reinforcement Learning at the Edge of the Statistical Precipice
Figure 2 for Deep Reinforcement Learning at the Edge of the Statistical Precipice
Figure 3 for Deep Reinforcement Learning at the Edge of the Statistical Precipice
Figure 4 for Deep Reinforcement Learning at the Edge of the Statistical Precipice
Viaarxiv icon