Alert button
Picture for Marc'Aurelio Ranzato

Marc'Aurelio Ranzato

Alert button

Asynchronous Local-SGD Training for Language Modeling

Jan 17, 2024
Bo Liu, Rachita Chhaparia, Arthur Douillard, Satyen Kale, Andrei A. Rusu, Jiajun Shen, Arthur Szlam, Marc'Aurelio Ranzato

Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Nov 14, 2023
Arthur Douillard, Qixuan Feng, Andrei A. Rusu, Rachita Chhaparia, Yani Donchev, Adhiguna Kuncoro, Marc'Aurelio Ranzato, Arthur Szlam, Jiajun Shen

Viaarxiv icon

Towards Robust and Efficient Continual Language Learning

Jul 11, 2023
Adam Fisch, Amal Rannen-Triki, Razvan Pascanu, Jörg Bornschein, Angeliki Lazaridou, Elena Gribovskaya, Marc'Aurelio Ranzato

Figure 1 for Towards Robust and Efficient Continual Language Learning
Figure 2 for Towards Robust and Efficient Continual Language Learning
Figure 3 for Towards Robust and Efficient Continual Language Learning
Figure 4 for Towards Robust and Efficient Continual Language Learning
Viaarxiv icon

Towards Compute-Optimal Transfer Learning

Apr 25, 2023
Massimo Caccia, Alexandre Galashov, Arthur Douillard, Amal Rannen-Triki, Dushyant Rao, Michela Paganini, Laurent Charlin, Marc'Aurelio Ranzato, Razvan Pascanu

Figure 1 for Towards Compute-Optimal Transfer Learning
Figure 2 for Towards Compute-Optimal Transfer Learning
Figure 3 for Towards Compute-Optimal Transfer Learning
Figure 4 for Towards Compute-Optimal Transfer Learning
Viaarxiv icon

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Nov 15, 2022
Jorg Bornschein, Alexandre Galashov, Ross Hemsley, Amal Rannen-Triki, Yutian Chen, Arslan Chaudhry, Xu Owen He, Arthur Douillard, Massimo Caccia, Qixuang Feng, Jiajun Shen, Sylvestre-Alvise Rebuffi, Kitty Stacpoole, Diego de las Casas, Will Hawkins, Angeliki Lazaridou, Yee Whye Teh, Andrei A. Rusu, Razvan Pascanu, Marc'Aurelio Ranzato

Figure 1 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 2 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 3 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 4 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Viaarxiv icon

Multi-step Planning for Automated Hyperparameter Optimization with OptFormer

Oct 10, 2022
Lucio M. Dery, Abram L. Friesen, Nando De Freitas, Marc'Aurelio Ranzato, Yutian Chen

Figure 1 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 2 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 3 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 4 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Viaarxiv icon

On Anytime Learning at Macroscale

Jun 17, 2021
Lucas Caccia, Jing Xu, Myle Ott, Marc'Aurelio Ranzato, Ludovic Denoyer

Figure 1 for On Anytime Learning at Macroscale
Figure 2 for On Anytime Learning at Macroscale
Figure 3 for On Anytime Learning at Macroscale
Figure 4 for On Anytime Learning at Macroscale
Viaarxiv icon

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Jun 06, 2021
Naman Goyal, Cynthia Gao, Vishrav Chaudhary, Peng-Jen Chen, Guillaume Wenzek, Da Ju, Sanjana Krishnan, Marc'Aurelio Ranzato, Francisco Guzman, Angela Fan

Figure 1 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 2 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 3 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 4 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Viaarxiv icon

Efficient Continual Learning with Modular Networks and Task-Driven Priors

Dec 23, 2020
Tom Veniat, Ludovic Denoyer, Marc'Aurelio Ranzato

Figure 1 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 2 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 3 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 4 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Viaarxiv icon

Few-shot Sequence Learning with Transformers

Dec 17, 2020
Lajanugen Logeswaran, Ann Lee, Myle Ott, Honglak Lee, Marc'Aurelio Ranzato, Arthur Szlam

Figure 1 for Few-shot Sequence Learning with Transformers
Figure 2 for Few-shot Sequence Learning with Transformers
Figure 3 for Few-shot Sequence Learning with Transformers
Figure 4 for Few-shot Sequence Learning with Transformers
Viaarxiv icon