Alert button

Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Mar 12, 2024
Sainbayar Sukhbaatar, Olga Golovneva, Vasu Sharma, Hu Xu, Xi Victoria Lin, Baptiste Rozière, Jacob Kahn, Daniel Li, Wen-tau Yih, Jason Weston, Xian Li

Figure 1 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 2 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 3 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM
Figure 4 for Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: