Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference

Add code
Jan 17, 2024
Figure 1 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
Figure 2 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
Figure 3 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference
Figure 4 for Exploiting Inter-Layer Expert Affinity for Accelerating Mixture-of-Experts Model Inference

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: