Picture for Suraiya Tairin

Suraiya Tairin

eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference

Add code
Mar 10, 2025
Figure 1 for eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference
Figure 2 for eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference
Figure 3 for eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference
Figure 4 for eMoE: Task-aware Memory Efficient Mixture-of-Experts-Based (MoE) Model Inference
Viaarxiv icon