Alert button

Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer

Oct 15, 2023
Boan Liu, Liang Ding, Li Shen, Keqin Peng, Yu Cao, Dazhao Cheng, Dacheng Tao

Share this with someone who'll enjoy it:

View paper onarxiv icon

Share this with someone who'll enjoy it: