Once you know the theory of machine learning, transformers, and all the extras e.g. LoRA and QLoRA, and you know how to use the Hugging Face interface, you essentially know what you need to start finetuning, but that doesn’t mean you’re any good at it.
There’s so many parameters and having studied the theory, you know what each one means theoretically, but practically you don’t have enough experience to be able to make insights like “hmm yeah it seems the learning rate is too high”. Aside from just practice how do you gain these intuitions?
As an example, I have a notebook here and I feel like I set up everything right, but as you can see, the evaluation before and after is unchanged.