Scaling Language Models: Methods, Analysis & Insights from Training Gopher
Improving language models by retrieving from trillions of tokens
Browser-assisted question-answering with human feedback
Efficient Scaling of Language Models with Mixture-of-Experts
Multitask Prompted Training Enables Zero-Shot Task Generalization