Picture for Torsten Hoefler

Torsten Hoefler

Demystifying Higher-Order Graph Neural Networks

Add code
Jun 18, 2024
Viaarxiv icon

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs

Add code
Jun 07, 2024
Viaarxiv icon

CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks

Add code
Jun 04, 2024
Figure 1 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 2 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 3 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 4 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Viaarxiv icon

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Add code
Mar 30, 2024
Viaarxiv icon

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Add code
Jan 26, 2024
Viaarxiv icon

Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts

Add code
Jan 25, 2024
Figure 1 for Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts
Figure 2 for Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts
Figure 3 for Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts
Figure 4 for Topologies of Reasoning: Demystifying Chains, Trees, and Graphs of Thoughts
Viaarxiv icon

Swing: Short-cutting Rings for Higher Bandwidth Allreduce

Add code
Jan 17, 2024
Figure 1 for Swing: Short-cutting Rings for Higher Bandwidth Allreduce
Figure 2 for Swing: Short-cutting Rings for Higher Bandwidth Allreduce
Figure 3 for Swing: Short-cutting Rings for Higher Bandwidth Allreduce
Figure 4 for Swing: Short-cutting Rings for Higher Bandwidth Allreduce
Viaarxiv icon

DiffDA: a diffusion model for weather-scale data assimilation

Add code
Jan 11, 2024
Viaarxiv icon

How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark

Add code
Dec 21, 2023
Figure 1 for How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark
Figure 2 for How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark
Figure 3 for How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark
Figure 4 for How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark
Viaarxiv icon

HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers

Add code
Nov 30, 2023
Figure 1 for HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers
Figure 2 for HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers
Figure 3 for HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers
Figure 4 for HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers
Viaarxiv icon