Picture for Marc'Aurelio Ranzato

Marc'Aurelio Ranzato

DiPaCo: Distributed Path Composition

Add code
Mar 15, 2024
Figure 1 for DiPaCo: Distributed Path Composition
Figure 2 for DiPaCo: Distributed Path Composition
Figure 3 for DiPaCo: Distributed Path Composition
Figure 4 for DiPaCo: Distributed Path Composition
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Jan 17, 2024
Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Nov 14, 2023
Figure 1 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 2 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 3 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 4 for DiLoCo: Distributed Low-Communication Training of Language Models
Viaarxiv icon

Towards Robust and Efficient Continual Language Learning

Add code
Jul 11, 2023
Figure 1 for Towards Robust and Efficient Continual Language Learning
Figure 2 for Towards Robust and Efficient Continual Language Learning
Figure 3 for Towards Robust and Efficient Continual Language Learning
Figure 4 for Towards Robust and Efficient Continual Language Learning
Viaarxiv icon

Towards Compute-Optimal Transfer Learning

Add code
Apr 25, 2023
Figure 1 for Towards Compute-Optimal Transfer Learning
Figure 2 for Towards Compute-Optimal Transfer Learning
Figure 3 for Towards Compute-Optimal Transfer Learning
Figure 4 for Towards Compute-Optimal Transfer Learning
Viaarxiv icon

NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research

Add code
Nov 15, 2022
Figure 1 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 2 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 3 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Figure 4 for NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research
Viaarxiv icon

Multi-step Planning for Automated Hyperparameter Optimization with OptFormer

Add code
Oct 10, 2022
Figure 1 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 2 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 3 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Figure 4 for Multi-step Planning for Automated Hyperparameter Optimization with OptFormer
Viaarxiv icon

On Anytime Learning at Macroscale

Add code
Jun 17, 2021
Figure 1 for On Anytime Learning at Macroscale
Figure 2 for On Anytime Learning at Macroscale
Figure 3 for On Anytime Learning at Macroscale
Figure 4 for On Anytime Learning at Macroscale
Viaarxiv icon

The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation

Add code
Jun 06, 2021
Figure 1 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 2 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 3 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Figure 4 for The FLORES-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation
Viaarxiv icon

Efficient Continual Learning with Modular Networks and Task-Driven Priors

Add code
Dec 23, 2020
Figure 1 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 2 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 3 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Figure 4 for Efficient Continual Learning with Modular Networks and Task-Driven Priors
Viaarxiv icon