Picture for Torsten Hoefler

Torsten Hoefler

All models are wrong, some are useful: Model Selection with Limited Labels

Add code
Oct 17, 2024
Figure 1 for All models are wrong, some are useful: Model Selection with Limited Labels
Figure 2 for All models are wrong, some are useful: Model Selection with Limited Labels
Figure 3 for All models are wrong, some are useful: Model Selection with Limited Labels
Figure 4 for All models are wrong, some are useful: Model Selection with Limited Labels
Viaarxiv icon

Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud

Add code
Oct 08, 2024
Figure 1 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 2 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 3 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Figure 4 for Fortify Your Foundations: Practical Privacy and Security for Foundation Model Deployments In The Cloud
Viaarxiv icon

Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects

Add code
Aug 26, 2024
Figure 1 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 2 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 3 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Figure 4 for Exploring GPU-to-GPU Communication: Insights into Supercomputer Interconnects
Viaarxiv icon

Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments

Add code
Aug 22, 2024
Figure 1 for Hardware Acceleration for Knowledge Graph Processing: Challenges & Recent Developments
Viaarxiv icon

MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models

Add code
Aug 21, 2024
Figure 1 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Figure 2 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Figure 3 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Figure 4 for MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models
Viaarxiv icon

Demystifying Higher-Order Graph Neural Networks

Add code
Jun 18, 2024
Figure 1 for Demystifying Higher-Order Graph Neural Networks
Figure 2 for Demystifying Higher-Order Graph Neural Networks
Figure 3 for Demystifying Higher-Order Graph Neural Networks
Figure 4 for Demystifying Higher-Order Graph Neural Networks
Viaarxiv icon

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs

Add code
Jun 07, 2024
Viaarxiv icon

CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks

Add code
Jun 04, 2024
Figure 1 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 2 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 3 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Figure 4 for CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks
Viaarxiv icon

QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs

Add code
Mar 30, 2024
Figure 1 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Figure 2 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Figure 3 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Figure 4 for QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
Viaarxiv icon

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Add code
Jan 26, 2024
Figure 1 for SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Figure 2 for SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Figure 3 for SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Figure 4 for SliceGPT: Compress Large Language Models by Deleting Rows and Columns
Viaarxiv icon