default search action
"Diversifying the Mixture-of-Experts Representation for Language Models ..."
Boan Liu et al. (2023)
- Boan Liu, Liang Ding, Li Shen, Keqin Peng, Yu Cao, Dazhao Cheng, Dacheng Tao:
Diversifying the Mixture-of-Experts Representation for Language Models with Orthogonal Optimizer. CoRR abs/2310.09762 (2023)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.