There is no guarantee that the cost will oscillate if you use small mini-batches. It can happen, but itβs not guaranteed to happen. It all depends on the properties of your data and the model you have specified. Of course the values of other hyperparameters like the learning rate are influential here as well.