Alert button
Picture for Andrei A. Rusu

Andrei A. Rusu

Alert button

DiPaCo: Distributed Path Composition

Add code
Bookmark button
Alert button
Mar 15, 2024
Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Adhiguna Kuncoro, Yani Donchev, Rachita Chhaparia, Ionel Gog, Marc'Aurelio Ranzato, Jiajun Shen, Arthur Szlam

Figure 1 for DiPaCo: Distributed Path Composition
Figure 2 for DiPaCo: Distributed Path Composition
Figure 3 for DiPaCo: Distributed Path Composition
Figure 4 for DiPaCo: Distributed Path Composition
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Bookmark button
Alert button
Jan 17, 2024
Bo Liu, Rachita Chhaparia, Arthur Douillard, Satyen Kale, Andrei A. Rusu, Jiajun Shen, Arthur Szlam, Marc'Aurelio Ranzato

Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Bookmark button
Alert button
Nov 14, 2023
Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, Marc'Aurelio Ranzato, Arthur Szlam, Jiajun Shen

Viaarxiv icon

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Add code
Bookmark button
Alert button
Nov 15, 2022
Jorg Bornschein, Alexandre Galashov, Ross Hemsley, Amal Rannen-Triki, Yutian Chen, Arslan Chaudhry, Xu Owen He, Arthur Douillard, Massimo Caccia, Qixuang Feng, Jiajun Shen, Sylvestre-Alvise Rebuffi, Kitty Stacpoole, Diego de las Casas, Will Hawkins, Angeliki Lazaridou, Yee Whye Teh, Andrei A. Rusu, Razvan Pascanu, Marc'Aurelio Ranzato

Figure 1 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 2 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 3 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 4 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Viaarxiv icon

Continual Unsupervised Representation Learning

Add code
Bookmark button
Alert button
Oct 31, 2019
Dushyant Rao, Francesco Visin, Andrei A. Rusu, Yee Whye Teh, Razvan Pascanu, Raia Hadsell

Figure 1 for Continual Unsupervised Representation Learning
Figure 2 for Continual Unsupervised Representation Learning
Figure 3 for Continual Unsupervised Representation Learning
Figure 4 for Continual Unsupervised Representation Learning
Viaarxiv icon

Meta-Learning with Warped Gradient Descent

Add code
Bookmark button
Alert button
Aug 30, 2019
Sebastian Flennerhag, Andrei A. Rusu, Razvan Pascanu, Hujun Yin, Raia Hadsell

Figure 1 for Meta-Learning with Warped Gradient Descent
Figure 2 for Meta-Learning with Warped Gradient Descent
Figure 3 for Meta-Learning with Warped Gradient Descent
Figure 4 for Meta-Learning with Warped Gradient Descent
Viaarxiv icon

Task Agnostic Continual Learning via Meta Learning

Add code
Bookmark button
Alert button
Jun 12, 2019
Xu He, Jakub Sygnowski, Alexandre Galashov, Andrei A. Rusu, Yee Whye Teh, Razvan Pascanu

Figure 1 for Task Agnostic Continual Learning via Meta Learning
Figure 2 for Task Agnostic Continual Learning via Meta Learning
Figure 3 for Task Agnostic Continual Learning via Meta Learning
Figure 4 for Task Agnostic Continual Learning via Meta Learning
Viaarxiv icon

Meta-Learning with Latent Embedding Optimization

Add code
Bookmark button
Alert button
Sep 28, 2018
Andrei A. Rusu, Dushyant Rao, Jakub Sygnowski, Oriol Vinyals, Razvan Pascanu, Simon Osindero, Raia Hadsell

Figure 1 for Meta-Learning with Latent Embedding Optimization
Figure 2 for Meta-Learning with Latent Embedding Optimization
Figure 3 for Meta-Learning with Latent Embedding Optimization
Viaarxiv icon

Meta-Learning by the Baldwin Effect

Add code
Bookmark button
Alert button
Jun 22, 2018
Chrisantha Thomas Fernando, Jakub Sygnowski, Simon Osindero, Jane Wang, Tom Schaul, Denis Teplyashin, Pablo Sprechmann, Alexander Pritzel, Andrei A. Rusu

Figure 1 for Meta-Learning by the Baldwin Effect
Figure 2 for Meta-Learning by the Baldwin Effect
Figure 3 for Meta-Learning by the Baldwin Effect
Figure 4 for Meta-Learning by the Baldwin Effect
Viaarxiv icon

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 06, 2018
Irina Higgins, Arka Pal, Andrei A. Rusu, Loic Matthey, Christopher P Burgess, Alexander Pritzel, Matthew Botvinick, Charles Blundell, Alexander Lerchner

Figure 1 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Figure 2 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Figure 3 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Figure 4 for DARLA: Improving Zero-Shot Transfer in Reinforcement Learning
Viaarxiv icon