Picture for Torsten Hoefler

Torsten Hoefler

Finetuning a Weather Foundation Model with Lightweight Decoders for Unseen Physical Processes

Add code
Jun 23, 2025
Viaarxiv icon

Affordable AI Assistants with Knowledge Graph of Thoughts

Add code
Apr 03, 2025
Viaarxiv icon

Reasoning Language Models: A Blueprint

Add code
Jan 20, 2025
Viaarxiv icon

HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning

Add code
Jan 05, 2025
Figure 1 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 2 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 3 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Figure 4 for HALO: Hadamard-Assisted Lossless Optimization for Efficient Low-Precision LLM Training and Fine-Tuning
Viaarxiv icon

EfQAT: An Efficient Framework for Quantization-Aware Training

Add code
Nov 17, 2024
Figure 1 for EfQAT: An Efficient Framework for Quantization-Aware Training
Figure 2 for EfQAT: An Efficient Framework for Quantization-Aware Training
Figure 3 for EfQAT: An Efficient Framework for Quantization-Aware Training
Figure 4 for EfQAT: An Efficient Framework for Quantization-Aware Training
Viaarxiv icon

All models are wrong, some are useful: Model Selection with Limited Labels

Add code
Oct 17, 2024
Figure 1 for All models are wrong, some are useful: Model Selection with Limited Labels
Figure 2 for All models are wrong, some are useful: Model Selection with Limited Labels
Figure 3 for All models are wrong, some are useful: Model Selection with Limited Labels
Figure 4 for All models are wrong, some are useful: Model Selection with Limited Labels
Viaarxiv icon

Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud

Add code
Oct 08, 2024
Figure 1 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 2 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 3 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 4 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Viaarxiv icon

Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects

Add code
Aug 26, 2024
Figure 1 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 2 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 3 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 4 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Viaarxiv icon

Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments

Add code
Aug 22, 2024
Figure 1 for Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments
Viaarxiv icon

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models

Add code
Aug 21, 2024
Figure 1 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Figure 2 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Figure 3 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Figure 4 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Viaarxiv icon