Open Pre-trained Transformer Language Models
Training Compute-Optimal Large Language Models
Scaling Language Modeling with Pathways
Training language models to follow instructions with human feedback
Using Deep and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model