Course 2, week 2, update_parameters_with_momentum() issue

francisminhan · November 24, 2021, 12:11am

I got my code pass the checker but don’t entire understand the math behind this.

v[“dW” + str(l)] = beta * v[“dW” + str(l)] + (1 - beta) * grads[“dW” + str(l)] (1)
which is supposedly the correct implementation is the same as:
v[“dW” + str(l)] = (1 - beta) * grads[“dW” + str(l)] (2)
since v[“dW” + str(l)] is initialized = 0.

I tried (2) and pass all test.

Should it be v[“dW” + str (l-1) ] for l>1 and just 0 for l=1, as we take ‘beta’ part of the LAST momentum and give it a bit more acceleration?

Am I understanding this correctly?

paulinpaloalto · November 24, 2021, 1:15am

Your formulas 1) and 2) are not equivalent. Note that v[“dW1”] is not the same thing as grads[“dW1”]. Also note that these methods are iterative, right? So the fact that the velocity is initialized to zero is not relevant after the first iteration.

francisminhan · November 24, 2021, 3:37am

For a second I confused an iteration with a layer. I realized that immediately after moving on to the next function in the exercise. This is really helpful. Thank you for fast response, sir.

Topic		Replies	Views
C2, W2, a question about update_parameters_with_momentum Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	502	September 3, 2022
Momentum Formula Improving Deep Neural Networks: Hyperparameter tun week-module-2 , coursera-platform	5	212	May 18, 2024
Course 2, Week 2, Assignment Optimization_methods Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	447	October 12, 2023
C2, W2, Programming Task / Update_parameters_with_momentum Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	632	September 27, 2021
Error in Programming Assignment: Optimization Methods Improving Deep Neural Networks: Hyperparameter tun week-module-2 , coursera-platform	3	178	May 19, 2024

Course 2, week 2, update_parameters_with_momentum() issue

Related topics