Picture for Torsten Hoefler

Torsten Hoefler

Towards End-to-end 4-Bit Inference on Generative Large Language Models

Add code
Oct 13, 2023
Figure 1 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 2 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 3 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 4 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Viaarxiv icon

VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores

Add code
Oct 03, 2023
Figure 1 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Figure 2 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Figure 3 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Figure 4 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Viaarxiv icon

Earth Virtualization Engines -- A Technical Perspective

Add code
Sep 16, 2023
Viaarxiv icon

Cached Operator Reordering: A Unified View for Fast GNN Training

Add code
Aug 23, 2023
Viaarxiv icon

Graph of Thoughts: Solving Elaborate Problems with Large Language Models

Add code
Aug 21, 2023
Figure 1 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Figure 2 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Figure 3 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Figure 4 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Viaarxiv icon

Differentiable Transportation Pruning

Add code
Jul 31, 2023
Viaarxiv icon

Co-design Hardware and Algorithm for Vector Search

Add code
Jul 06, 2023
Viaarxiv icon

SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression

Add code
Jun 05, 2023
Viaarxiv icon

ASDL: A Unified Interface for Gradient Preconditioning in PyTorch

Add code
May 08, 2023
Viaarxiv icon

STen: Productive and Efficient Sparsity in PyTorch

Add code
Apr 15, 2023
Viaarxiv icon