Week 02 - 6.1 Mini-Batch Gradient Descent → Why not zig zac cost

quoc · August 30, 2021, 6:35pm

Hello
After running the Mini-batch gradient descent, the cost is suppose to zigzag as in the lecture. However, this is what I got:

Is this because the batch size is relative small so the zigzag is not so big and make it looks like a smooth line?

Thank you

paulinpaloalto · August 30, 2021, 7:05pm

There is no guarantee that the cost will oscillate if you use small mini-batches. It can happen, but it’s not guaranteed to happen. It all depends on the properties of your data and the model you have specified. Of course the values of other hyperparameters like the learning rate are influential here as well.

Topic		Replies	Views
Mini-batch gradient descent decreasing Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	701	September 3, 2021
Trajectory direct path batch gradient descent Supervised ML: Regression and Classification week-module-2	4	297	October 28, 2023
Cost Function Graph not having Spikes when using Mini-Batch GD in Tensorflow Improving Deep Neural Networks: Hyperparameter tun week-module-3 , ai-discussions	2	20	June 22, 2025
Doubt regarding learning rate decay mechanism Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	514	January 17, 2023
Gradient descent Neural Networks and Deep Learning coursera-platform	4	656	December 15, 2021

Week 02 - 6.1 Mini-Batch Gradient Descent → Why not zig zac cost

Related topics