Momentum descent and RMS Prop

Anbu · July 11, 2021, 3:39pm

Hi Sir,

In RMS prop video lecture proff told that large learning rate can be used without diverging in the vertical direction. If so the statement true for RMS_Prop means, why the same statement not told for Gradient descent Momentum lecture ? Is it not applicable for momentum ?
From the implementation notes of EWA lecture, we need the below code to compute average over last 10 days temperature

shot1

From the Gradient descent momentum lecture, in the implementation details slide, VDW with beta =0.9 can compute average over last 10 iteration gradients. If so the statement, Why the above same code (repeat get next theta t ) not used or specified here ?

In the above pic, On iteration t means what does it meaning ?

Sorry If im asking you lot of quesions sir. I think u can help me to clarify.

nramon · May 18, 2022, 10:18am

Hi, @Anbu.

I’m sorry we missed this topic

Momentum with a large learning rate may overshoot the minimum. You can use this simulator to compare the behaviors of RMSprop and Momentum with different learning rates.
It’s the same pseudocode! "On iteration t" is equivalent to “Repeat”, and “Compute dW,dB” is equivalent to "Get next \theta_t".
An iteration is just a gradient descent step. Think of "On iteration t" as a loop:

for t in range(num_minibatches):
    dW, db = ...
    v_dW = ...
    v_db = ...
    W = ...

Good luck with the specialization

Topic		Replies	Views
RMS Prop vs GD With Momentum Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	556	May 24, 2021
RMS prop in a favorable setting Improving Deep Neural Networks: Hyperparameter tun coursera-platform	11	780	September 11, 2021
Checking Intuition: RMSprop Normalization vs Speed Improvement (Post: RMSprop lecture) Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	682	October 10, 2022
RMSprop can go wrong? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	720	April 29, 2023
Doubt regarding course 2 Week 2 assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	503	November 27, 2022

Momentum descent and RMS Prop

Related topics