Alert button

SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference

Add code
Bookmark button
Alert button
Jul 05, 2023
Luciano Del Corro, Allie Del Giorno, Sahaj Agarwal, Bin Yu, Ahmed Awadallah, Subhabrata Mukherjee

Figure 1 for SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference
Figure 2 for SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference
Figure 3 for SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference
Figure 4 for SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: