Picture for Mahsa Salmani

Mahsa Salmani

LLM Inference Acceleration via Efficient Operation Fusion

Add code
Feb 24, 2025
Figure 1 for LLM Inference Acceleration via Efficient Operation Fusion
Figure 2 for LLM Inference Acceleration via Efficient Operation Fusion
Figure 3 for LLM Inference Acceleration via Efficient Operation Fusion
Figure 4 for LLM Inference Acceleration via Efficient Operation Fusion
Viaarxiv icon

SLaNC: Static LayerNorm Calibration

Add code
Oct 14, 2024
Viaarxiv icon

Beyond the Limits: A Survey of Techniques to Extend the Context Length in Large Language Models

Add code
Feb 03, 2024
Viaarxiv icon