Picture for Torsten Hoefler

Torsten Hoefler

PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices

Add code
Nov 25, 2022
Figure 1 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Figure 2 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Figure 3 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Figure 4 for PipeFisher: Efficient Training of Large Language Models Using Pipelining and Fisher Information Matrices
Viaarxiv icon

Spatial Mixture-of-Experts

Add code
Nov 24, 2022
Figure 1 for Spatial Mixture-of-Experts
Figure 2 for Spatial Mixture-of-Experts
Figure 3 for Spatial Mixture-of-Experts
Figure 4 for Spatial Mixture-of-Experts
Viaarxiv icon

GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers

Add code
Oct 31, 2022
Viaarxiv icon

Compressing multidimensional weather and climate data into neural networks

Add code
Oct 22, 2022
Viaarxiv icon

Neural Graph Databases

Add code
Sep 20, 2022
Figure 1 for Neural Graph Databases
Figure 2 for Neural Graph Databases
Figure 3 for Neural Graph Databases
Figure 4 for Neural Graph Databases
Viaarxiv icon

Efficient Quantized Sparse Matrix Operations on Tensor Cores

Add code
Sep 14, 2022
Figure 1 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Figure 2 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Figure 3 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Figure 4 for Efficient Quantized Sparse Matrix Operations on Tensor Cores
Viaarxiv icon

HammingMesh: A Network Topology for Large-Scale Deep Learning

Add code
Sep 03, 2022
Figure 1 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Figure 2 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Figure 3 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Figure 4 for HammingMesh: A Network Topology for Large-Scale Deep Learning
Viaarxiv icon

ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast

Add code
Jun 29, 2022
Figure 1 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Figure 2 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Figure 3 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Figure 4 for ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast
Viaarxiv icon

Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis

Add code
May 30, 2022
Figure 1 for Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis
Figure 2 for Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis
Figure 3 for Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis
Figure 4 for Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis
Viaarxiv icon

Near-Optimal Sparse Allreduce for Distributed Deep Learning

Add code
Jan 19, 2022
Figure 1 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Figure 2 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Figure 3 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Figure 4 for Near-Optimal Sparse Allreduce for Distributed Deep Learning
Viaarxiv icon