Picture for Luca Wehrstedt

Luca Wehrstedt

Sid

Accelerating Transformer Inference and Training with 2:4 Activation Sparsity

Add code
Mar 20, 2025
Figure 1 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Figure 2 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Figure 3 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Figure 4 for Accelerating Transformer Inference and Training with 2:4 Activation Sparsity
Viaarxiv icon

Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training

Add code
Nov 20, 2024
Figure 1 for Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training
Figure 2 for Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training
Figure 3 for Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training
Figure 4 for Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training
Viaarxiv icon

The Llama 3 Herd of Models

Add code
Jul 31, 2024
Viaarxiv icon

PyTorch-BigGraph: A Large-scale Graph Embedding System

Add code
Apr 09, 2019
Figure 1 for PyTorch-BigGraph: A Large-scale Graph Embedding System
Figure 2 for PyTorch-BigGraph: A Large-scale Graph Embedding System
Figure 3 for PyTorch-BigGraph: A Large-scale Graph Embedding System
Figure 4 for PyTorch-BigGraph: A Large-scale Graph Embedding System
Viaarxiv icon