Picture for Arthur Szlam

Arthur Szlam

Context Training with Active Information Seeking

Add code
May 14, 2026
Viaarxiv icon

Decoupled DiLoCo for Resilient Distributed Pre-training

Add code
Apr 23, 2026
Viaarxiv icon

Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo

Add code
Mar 12, 2025
Figure 1 for Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Figure 2 for Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Figure 3 for Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Figure 4 for Communication-Efficient Language Model Training Scales Reliably and Robustly: Scaling Laws for DiLoCo
Viaarxiv icon

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Add code
Jan 30, 2025
Viaarxiv icon

Deliberation in Latent Space via Differentiable Cache Augmentation

Add code
Dec 23, 2024
Figure 1 for Deliberation in Latent Space via Differentiable Cache Augmentation
Figure 2 for Deliberation in Latent Space via Differentiable Cache Augmentation
Figure 3 for Deliberation in Latent Space via Differentiable Cache Augmentation
Figure 4 for Deliberation in Latent Space via Differentiable Cache Augmentation
Viaarxiv icon

DiPaCo: Distributed Path Composition

Add code
Mar 15, 2024
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Jan 17, 2024
Figure 1 for Asynchronous Local-SGD Training for Language Modeling
Figure 2 for Asynchronous Local-SGD Training for Language Modeling
Figure 3 for Asynchronous Local-SGD Training for Language Modeling
Figure 4 for Asynchronous Local-SGD Training for Language Modeling
Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Nov 14, 2023
Figure 1 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 2 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 3 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 4 for DiLoCo: Distributed Low-Communication Training of Language Models
Viaarxiv icon

A Data Source for Reasoning Embodied Agents

Add code
Sep 14, 2023
Figure 1 for A Data Source for Reasoning Embodied Agents
Figure 2 for A Data Source for Reasoning Embodied Agents
Figure 3 for A Data Source for Reasoning Embodied Agents
Figure 4 for A Data Source for Reasoning Embodied Agents
Viaarxiv icon

Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions

Add code
May 18, 2023
Figure 1 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Figure 2 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Figure 3 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Figure 4 for Transforming Human-Centered AI Collaboration: Redefining Embodied Agents Capabilities through Interactive Grounded Language Instructions
Viaarxiv icon