Alert button
Picture for Marc'Aurelio Ranzato

Marc'Aurelio Ranzato

Alert button

DiPaCo: Distributed Path Composition

Add code
Bookmark button
Alert button
Mar 15, 2024
Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Adhiguna Kuncoro, Yani Donchev, Rachita Chhaparia, Ionel Gog, Marc'Aurelio Ranzato, Jiajun Shen, Arthur Szlam

Figure 1 for DiPaCo: Distributed Path Composition
Figure 2 for DiPaCo: Distributed Path Composition
Figure 3 for DiPaCo: Distributed Path Composition
Figure 4 for DiPaCo: Distributed Path Composition
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Bookmark button
Alert button
Jan 17, 2024
Bo Liu, Rachita Chhaparia, Arthur Douillard, Satyen Kale, Andrei A. Rusu, Jiajun Shen, Arthur Szlam, Marc'Aurelio Ranzato

Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Bookmark button
Alert button
Nov 14, 2023
Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, Marc'Aurelio Ranzato, Arthur Szlam, Jiajun Shen

Viaarxiv icon

Towards Robust and Efficient Continual Language Learning

Add code
Bookmark button
Alert button
Jul 11, 2023
Adam Fisch, Amal Rannen-Triki, Razvan Pascanu, Jörg Bornschein, Angeliki Lazaridou, Elena Gribovskaya, Marc'Aurelio Ranzato

Figure 1 for Towards Robust and Efficient Continual Language Learning
Figure 2 for Towards Robust and Efficient Continual Language Learning
Figure 3 for Towards Robust and Efficient Continual Language Learning
Figure 4 for Towards Robust and Efficient Continual Language Learning
Viaarxiv icon

Towards Compute-Optimal Transfer Learning

Add code
Bookmark button
Alert button
Apr 25, 2023
Massimo Caccia, Alexandre Galashov, Arthur Douillard, Amal Rannen-Triki, Dushyant Rao, Michela Paganini, Laurent Charlin, Marc'Aurelio Ranzato, Razvan Pascanu

Figure 1 for Towards Compute-Optimal Transfer Learning
Figure 2 for Towards Compute-Optimal Transfer Learning
Figure 3 for Towards Compute-Optimal Transfer Learning
Figure 4 for Towards Compute-Optimal Transfer Learning
Viaarxiv icon

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Add code
Bookmark button
Alert button
Nov 15, 2022
Jorg Bornschein, Alexandre Galashov, Ross Hemsley, Amal Rannen-Triki, Yutian Chen, Arslan Chaudhry, Xu Owen He, Arthur Douillard, Massimo Caccia, Qixuang Feng, Jiajun Shen, Sylvestre-Alvise Rebuffi, Kitty Stacpoole, Diego de las Casas, Will Hawkins, Angeliki Lazaridou, Yee Whye Teh, Andrei A. Rusu, Razvan Pascanu, Marc'Aurelio Ranzato

Figure 1 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 2 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 3 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 4 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Viaarxiv icon

Multi-step Planning for Automated Hyperparameter Optimization with OptFormer

Add code
Bookmark button
Alert button
Oct 10, 2022
Lucio M. Dery, Abram L. Friesen, Nando De Freitas, Marc'Aurelio Ranzato, Yutian Chen

Figure 1 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 2 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 3 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 4 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Viaarxiv icon

On Anytime Learning at Macroscale

Add code
Bookmark button
Alert button
Jun 17, 2021
Lucas Caccia, Jing Xu, Myle Ott, Marc'Aurelio Ranzato, Ludovic Denoyer

Figure 1 for On Anytime Learning at Macroscale
Figure 2 for On Anytime Learning at Macroscale
Figure 3 for On Anytime Learning at Macroscale
Figure 4 for On Anytime Learning at Macroscale
Viaarxiv icon

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Add code
Bookmark button
Alert button
Jun 06, 2021
Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzman, Angela Fan

Figure 1 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 2 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 3 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 4 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Viaarxiv icon

Efficient Continual Learning with Modular Networks and Task-Driven Priors

Add code
Bookmark button
Alert button
Dec 23, 2020
Tom Veniat, Ludovic Denoyer, Marc'Aurelio Ranzato

Figure 1 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 2 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 3 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 4 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Viaarxiv icon