Week 3 update_parameters, how to compute partial derivative J

quantypical · July 5, 2021, 5:07am

I’m having trouble with the programming assignment “Planar_data_classification_with_one_hidden_layer”, Exercise 7 - update_parameters. I need to implement the function for the equation: 𝜃 = 𝜃 − 𝛼(∂𝐽/∂𝜃)

but I can’t figure out how to compute the values for ∂𝐽 and ∂𝜃. I tried scanning back over the lecture notes and playing the videos, but I can’t locate where this equation was explained. It would be nice if the instructions in the assignment would include some kind of reminder.

What is the equation for these, or in which lecture was this covered? Thanks.

paulinpaloalto · July 5, 2021, 5:24am

It was all covered in this lecture and this lecture. The formulas are also given in the notebook. You compute the gradients in the backward_propagation routine and then they are just passed to update_parameters in the grads dictionary. Please see the section titled:

Exercise 6 - backward_propagation

In the Week 3 assignment. Note that Prof Ng does not use the notation \theta for the parameters in this course. He also tries to avoid using the mathematical notation for partial derivatives. He has invented his own notation in which he uses these shorthands for the gradient values:

dW^{[l]} = \displaystyle \frac {\partial J}{\partial W^{[l]}}
db^{[l]} = \displaystyle \frac {\partial J}{\partial b^{[l]}}

So using his notation, the formula for updating, say, W^{[l]} is:

W^{[l]} = W^{[l]} - \alpha * dW^{[l]}

With that in mind, please read the back propagation section of the notebook again and it should all make sense.

Note that he does not derive these formulas, though: he simply presents them. These courses are specifically designed not to require knowledge of even univariate calculus, let alone matrix calculus. If you have the math background, here’s a thread with pointers to derivations available on the web.

Topic		Replies	Views
C2_W2_Computation graph (Optional) Advanced Learning Algorithms week-2	5	512	March 16, 2023
DLS 2, Week 2, Exercise 6 update_parameters_with_adam Improving Deep Neural Networks: Hyperparameter tun	1	566	August 11, 2022
Partial Derivaties Neural Networks and Deep Learning	3	648	March 19, 2022
Week 2, Exercise 6 -- Error Improving Deep Neural Networks: Hyperparameter tun	2	570	February 3, 2022
Exercise 6 - backward_propagation in Programming Assignment Week 3 Neural Networks and Deep Learning	8	688	October 27, 2022

Week 3 update_parameters, how to compute partial derivative J

Exercise 6 - backward_propagation

Related topics