Mini-batch gradient descent decreasing

Hi, @henrikh.

It’s not the number of mini-batches, but rather the mini-batch number over time: first mini-batch, second mini-batch, etc.

I think it’s written like that to emphasize the fact that it’s taking one gradient descent step per mini-batch. This may be helpful too.

Hope you’re enjoying the course :slight_smile:

2 Likes