default search action
"Large Models are Parsimonious Learners: Activation Sparsity in Trained ..."
Zonglin Li et al. (2022)
- Zonglin Li, Chong You, Srinadh Bhojanapalli, Daliang Li, Ankit Singh Rawat, Sashank J. Reddi, Ke Ye, Felix Chern, Felix X. Yu, Ruiqi Guo, Sanjiv Kumar:
Large Models are Parsimonious Learners: Activation Sparsity in Trained Transformers. CoRR abs/2210.06313 (2022)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.