4. Common Models

Overview

This chapter will summarize widely used LLM architectures and families (GPT, LLaMA, T5, Mistral, etc.), with notes on design choices and trade-offs.

BERT

pLAM

GPT

LLaMA

glm

qwen

deepseek

Models