Introduction
An Example of Fine-Tuning:
Instruction tuning
RLHF and DPO
Choosing Weight Format and Optimisations
PEFT (Parameter-Efficient Fine-Tuning)
Hybrid Post-Training (HPT)
Abliteration
Conclusion