"S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training."

Yuezhou Hu, Jun Zhu, Jianfei Chen (2024)

Details and statistics

DOI: 10.48550/ARXIV.2409.09099

access: open

type: Informal or Other Publication

metadata version: 2024-10-12