Hi, @henrikh.
It’s not the number of mini-batches, but rather the mini-batch number over time: first mini-batch, second mini-batch, etc.
I think it’s written like that to emphasize the fact that it’s taking one gradient descent step per mini-batch. This may be helpful too.
Hope you’re enjoying the course