Gopher

Scaling Language Models: Methods, Analysis & Insights from Training Gopher

RETRO

Improving language models by retrieving from trillions of tokens

WebGPT

Browser-assisted question-answering with human feedback

GLaM

Efficient Scaling of Language Models with Mixture-of-Experts

T0

Multitask Prompted Training Enables Zero-Shot Task Generalization