default search action
"MPMoE: Memory Efficient MoE for Pre-Trained Models With Adaptive Pipeline ..."
Zheng Zhang et al. (2024)
- Zheng Zhang, Yaqi Xia, Hulin Wang, Donglin Yang, Chuang Hu, Xiaobo Zhou, Dazhao Cheng:
MPMoE: Memory Efficient MoE for Pre-Trained Models With Adaptive Pipeline Parallelism. IEEE Trans. Parallel Distributed Syst. 35(6): 843-856 (2024)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.