Picture for Joe Chau

Joe Chau

FP8-LM: Training FP8 Large Language Models

Add code
Oct 27, 2023
Figure 1 for FP8-LM: Training FP8 Large Language Models
Figure 2 for FP8-LM: Training FP8 Large Language Models
Figure 3 for FP8-LM: Training FP8 Large Language Models
Figure 4 for FP8-LM: Training FP8 Large Language Models
Viaarxiv icon

Tutel: Adaptive Mixture-of-Experts at Scale

Add code
Jun 07, 2022
Figure 1 for Tutel: Adaptive Mixture-of-Experts at Scale
Figure 2 for Tutel: Adaptive Mixture-of-Experts at Scale
Figure 3 for Tutel: Adaptive Mixture-of-Experts at Scale
Figure 4 for Tutel: Adaptive Mixture-of-Experts at Scale
Viaarxiv icon