Doubt regarding course 2 Week 2 assignment

M_A_Naidu · November 27, 2022, 6:55am

in the video for explaining about momentum the equation was given as
Screenshot 2022-11-27 at 12-21-17 C2_W2.pdf

But in the programming exercise the exponential momentum for vt is given as

can you explain about this

AbdElRhaman_Fakhry · November 27, 2022, 4:43pm

clearly what this image say is the Clarification about Screenshot 2022-11-27 at 12-21-17 C2_W2.pdf

what Exponentially weighted averages it is mean so it can be used in many application like Time series and Gradient descent with momentum so how to use concept of Exponentially weighted averages in Gradient descent with momentum is shown in this photo

as it mean how you can update weight (W,B) using the advantages of concept of Exponentially weighted averages

please feel free to ask any questions,
Thanks,
Abdelrahman

rmwkwok · November 27, 2022, 8:59pm

Hey @M_A_Naidu,

In addition to @AbdElRhaman_Fakhry’s explanation, on the other hand, we can also try to map variables (in the 1st screenshot) to variables (in the 1st eq in the 2nd screenshot) :

\theta_t to dW^{[l]}
v_{t-1} to v_{dW^{[l]}}
v_t to v_{dW^{[l]}}

Now (1) makes sense because the “thing” that we allow for momentum is the gradient. (2) is just the gradient’s momentum in the last time step ( time step = t-1), whereas (3) is the updated momentum at the current time step t.

You see t and t-1 respectively in (3) and (2) in the 1st screenshot but not in the 2nd screenshot, because the 2nd screenshot is oriented to how a program works. In the program, we do not store the momentum value by its timestep, instead we have one variable called v_{dW^{[l]}} which aggregrates over time steps.

What’s done in the 1st eq. of the 2nd screenshot is that, we recall the current value of v_{dW^{[l]}} (which presents the value in the last time step), multiply it with \beta, and sum it with the gradient of the current time step multiplied by 1-\beta. The result replaces the current value of v_{dW^{[l]}} so that it becomes the momentum value in the current timestep.

Does this make sense to you?

Cheers,
Raymond

Topic		Replies	Views
Update_parameters_with_momentum Improving Deep Neural Networks: Hyperparameter tun	4	471	June 8, 2023
Implementing exponentially weighted averages Improving Deep Neural Networks: Hyperparameter tun	3	521	April 5, 2023
Momentum Gradient Descent question Improving Deep Neural Networks: Hyperparameter tun	5	617	December 23, 2022
Confused on Exponentially Weighted Average Videos Improving Deep Neural Networks: Hyperparameter tun	4	417	August 19, 2023
Moving Average & Momentum Improving Deep Neural Networks: Hyperparameter tun	1	513	February 2, 2023

Doubt regarding course 2 Week 2 assignment

Related topics