Alert button

Hydragen: High-Throughput LLM Inference with Shared Prefixes

Feb 07, 2024
Jordan Juravsky, Bradley Brown, Ryan Ehrlich, Daniel Y. Fu, Christopher Ré, Azalia Mirhoseini

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: