Picture for Lutan Zhao

Lutan Zhao

Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity

Add code
May 12, 2025
Viaarxiv icon