Alert button

TCRA-LLM: Token Compression Retrieval Augmented Large Language Model for Inference Cost Reduction

Oct 25, 2023
Junyi Liu, Liangzhi Li, Tong Xiang, Bowen Wang, Yiming Qian

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: