Picture for Amir Gholami

Amir Gholami

UC Berkeley/LBNL/ICSI

Reliable edge machine learning hardware for scientific applications

Add code
Jun 27, 2024
Figure 1 for Reliable edge machine learning hardware for scientific applications
Figure 2 for Reliable edge machine learning hardware for scientific applications
Figure 3 for Reliable edge machine learning hardware for scientific applications
Figure 4 for Reliable edge machine learning hardware for scientific applications
Viaarxiv icon

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

Add code
Mar 22, 2024
Figure 1 for LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Figure 2 for LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Figure 3 for LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Figure 4 for LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
Viaarxiv icon

AI and Memory Wall

Add code
Mar 21, 2024
Figure 1 for AI and Memory Wall
Figure 2 for AI and Memory Wall
Figure 3 for AI and Memory Wall
Figure 4 for AI and Memory Wall
Viaarxiv icon

KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Add code
Feb 07, 2024
Viaarxiv icon

An LLM Compiler for Parallel Function Calling

Add code
Dec 07, 2023
Viaarxiv icon

SPEED: Speculative Pipelined Execution for Efficient Decoding

Add code
Oct 18, 2023
Viaarxiv icon

SqueezeLLM: Dense-and-Sparse Quantization

Add code
Jun 13, 2023
Figure 1 for SqueezeLLM: Dense-and-Sparse Quantization
Figure 2 for SqueezeLLM: Dense-and-Sparse Quantization
Figure 3 for SqueezeLLM: Dense-and-Sparse Quantization
Figure 4 for SqueezeLLM: Dense-and-Sparse Quantization
Viaarxiv icon

Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior

Add code
Jun 01, 2023
Figure 1 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Figure 2 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Figure 3 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Figure 4 for Towards Foundation Models for Scientific Machine Learning: Characterizing Scaling and Transfer Behavior
Viaarxiv icon

End-to-end codesign of Hessian-aware quantized neural networks for FPGAs and ASICs

Add code
Apr 13, 2023
Viaarxiv icon

Full Stack Optimization of Transformer Inference: a Survey

Add code
Feb 27, 2023
Viaarxiv icon