Understanding Mini batch size

Anbu · July 10, 2021, 10:57am

Hi Sir,

From the lecture mini batch gradient descent we had two doubts can u please help to clarify ?

At 8:05 mt, proff told that mini batch descent does not always exactly converge or oscillate in a very small region. Here in this statement, Does not always converge means should be oscillation around the minimum right sir ? We dont know why proff says does not oscillate also .

Second doubt is, if the algorithm is wandering around the minimum means, can we use small learning rate or reducing learning rate will help to converge to the global minimum ?

Thanks,
Thayanban

paulinpaloalto · July 10, 2021, 2:53pm

With a fixed learning rate, there is never any guarantee that Gradient Descent will converge either in Minibatch or Full Batch. You can always get oscillation or even actual divergence, rather than convergence. If you want better behavior, you have to use a more sophisticated algorithm that adapts the learning rate. Some coverage of this was added in the recent update of the courses. See the Optimization Assignment. Towards the end of that, they show a couple of ways to decrease the learning rate with more iterations.

Topic		Replies	Views
Doubt regarding learning rate decay mechanism Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	514	January 17, 2023
Mini-batch understanding Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	693	March 7, 2023
Mini-batch gradient descent decreasing Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	701	September 3, 2021
Conflict in concept of a video and assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	596	November 23, 2022
Can the gradient descent converge even if the learning rate set large and fixed Supervised ML: Regression and Classification week-module-1	1	496	August 6, 2022

Understanding Mini batch size

Related topics