FLAN

Finetuned Language Models Are Zero-Shot Learners

Codex

Evaluating Large Language Models Trained on Code

ELECTRA

Pre-training Text Encoders as Discriminators Rather Than Generators