Alert button

Toward Inference-optimal Mixture-of-Expert Large Language Models

Apr 03, 2024
Longfei Yun, Yonghao Zhuang, Yao Fu, Eric P Xing, Hao Zhang

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: