AI Engineering Handbook
Home
Chapters
1. Foundation
2. Pretraining
3. Post-training
4. Common Models
5. Applications
6. Training & Inference
7. Compression
8. Multimodal
LLM
3. Post-training
AI Engineering Handbook
LLM
1. Foundation
2. Pretraining
3. Post-training
4. Common Models
5. Applications
6. Training & Inference
7. Compression
8. Multimodal
On this page
SFT
RL
RLHF
PPO
DPO
optimized DPO
LLM
3. Post-training
3. Post-training
SFT
RL
RLHF
PPO
DPO
optimized DPO
PEFT
prompt tuning
p-tuning
prefix tuning
p-tuning v2
adaptor tuning
LORA
what is LoRA
what is lora+
vera
lora-fa
ada-lora
dora
x-lora