Introduction
An Example of Fine-Tuning:
Instruction tuning
RLHF and DPO
Choosing Weight Format and Optimisations
PEFT (Parameter-Efficient Fine-Tuning)
Abliteration
Conclusion