Picture for Jiajun Shen

Jiajun Shen

DiPaCo: Distributed Path Composition

Add code
Mar 15, 2024
Figure 1 for DiPaCo: Distributed Path Composition
Figure 2 for DiPaCo: Distributed Path Composition
Figure 3 for DiPaCo: Distributed Path Composition
Figure 4 for DiPaCo: Distributed Path Composition
Viaarxiv icon

Exploring Federated Self-Supervised Learning for General Purpose Audio Understanding

Add code
Feb 05, 2024
Viaarxiv icon

Asynchronous Local-SGD Training for Language Modeling

Add code
Jan 17, 2024
Viaarxiv icon

A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames

Add code
Dec 12, 2023
Viaarxiv icon

DiLoCo: Distributed Low-Communication Training of Language Models

Add code
Nov 14, 2023
Figure 1 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 2 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 3 for DiLoCo: Distributed Low-Communication Training of Language Models
Figure 4 for DiLoCo: Distributed Low-Communication Training of Language Models
Viaarxiv icon

Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations

Add code
Oct 25, 2023
Figure 1 for Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations
Figure 2 for Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations
Figure 3 for Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations
Figure 4 for Adaptive Uncertainty Estimation via High-Dimensional Testing on Latent Representations
Viaarxiv icon

L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning

Add code
Jul 14, 2023
Figure 1 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Figure 2 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Figure 3 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Figure 4 for L-DAWA: Layer-wise Divergence Aware Weight Aggregation in Federated Self-Supervised Visual Representation Learning
Viaarxiv icon

Source-Aware Embedding Training on Heterogeneous Information Networks

Add code
Jul 10, 2023
Figure 1 for Source-Aware Embedding Training on Heterogeneous Information Networks
Figure 2 for Source-Aware Embedding Training on Heterogeneous Information Networks
Figure 3 for Source-Aware Embedding Training on Heterogeneous Information Networks
Figure 4 for Source-Aware Embedding Training on Heterogeneous Information Networks
Viaarxiv icon

Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime

Add code
May 03, 2023
Figure 1 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Figure 2 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Figure 3 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Figure 4 for Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime
Viaarxiv icon

On the Benefits of Leveraging Structural Information in Planning Over the Learned Model

Add code
Mar 15, 2023
Figure 1 for On the Benefits of Leveraging Structural Information in Planning Over the Learned Model
Figure 2 for On the Benefits of Leveraging Structural Information in Planning Over the Learned Model
Figure 3 for On the Benefits of Leveraging Structural Information in Planning Over the Learned Model
Figure 4 for On the Benefits of Leveraging Structural Information in Planning Over the Learned Model
Viaarxiv icon