Paper Review
QLoRA
Efficient Finetuning of Quantized LLMs
Paper Review
Mamba
Linear-Time Sequence Modeling with Selective State Spaces
Paper Review
Mistral
Mistral 7B
Paper Review
LLM.int8()
8-bit Matrix Multiplication for Transformers at Scale
Paper Review
LLaMA 2
Open Foundation and Fine-Tuned Chat Models
1
2
3
4
…
17