RETRO

Improving language models by retrieving from trillions of tokens

WebGPT

Browser-assisted question-answering with human feedback

GLaM

Efficient Scaling of Language Models with Mixture-of-Experts

T0

Multitask Prompted Training Enables Zero-Shot Task Generalization

FLAN

Finetuned Language Models Are Zero-Shot Learners