I am currently going through the “Finetuning Large Language Models” lecture by Sharon Zhou. In lesson three, she discusses steps and batch size, but I’m a bit confused. From my understanding in deep learning and image classification, if there are 100 samples and a batch size of 10, then for each epoch, there would be 10 batches running through the model, which equates to 10 steps per epoch.
Am I understanding this correctly, or am I missing something? Can someone please educate me on this?