"Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A ..."

Shaden Smith et al. (2022)

Details and statistics

DOI:

access: open

type: Informal or Other Publication

metadata version: 2022-02-02