Picture for Guang Yan

Guang Yan

Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity

Add code
May 12, 2025
Figure 1 for Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Figure 2 for Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Figure 3 for Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Figure 4 for Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Viaarxiv icon