Picture for Guang Yan

Guang Yan

Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity

Add code
May 12, 2025
Viaarxiv icon