Picture for Torsten Hoefler

Torsten Hoefler

How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark

Add code
Dec 21, 2023
Viaarxiv icon

HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers

Add code
Nov 30, 2023
Viaarxiv icon

Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models

Add code
Oct 15, 2023
Figure 1 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Figure 2 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Figure 3 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Figure 4 for Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Viaarxiv icon

Towards End-to-end 4-Bit Inference on Generative Large Language Models

Add code
Oct 13, 2023
Figure 1 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 2 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 3 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Figure 4 for Towards End-to-end 4-Bit Inference on Generative Large Language Models
Viaarxiv icon

VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores

Add code
Oct 03, 2023
Figure 1 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Figure 2 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Figure 3 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Figure 4 for VENOM: A Vectorized N:M Format for Unleashing the Power of Sparse Tensor Cores
Viaarxiv icon

Earth Virtualization Engines -- A Technical Perspective

Add code
Sep 16, 2023
Viaarxiv icon

Cached Operator Reordering: A Unified View for Fast GNN Training

Add code
Aug 23, 2023
Viaarxiv icon

Graph of Thoughts: Solving Elaborate Problems with Large Language Models

Add code
Aug 21, 2023
Figure 1 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Figure 2 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Figure 3 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Figure 4 for Graph of Thoughts: Solving Elaborate Problems with Large Language Models
Viaarxiv icon

Differentiable Transportation Pruning

Add code
Jul 31, 2023
Viaarxiv icon

Co-design Hardware and Algorithm for Vector Search

Add code
Jul 06, 2023
Viaarxiv icon