Week 3 update_parameters, how to compute partial derivative J

paulinpaloalto · July 5, 2021, 5:24am

It was all covered in this lecture and this lecture. The formulas are also given in the notebook. You compute the gradients in the backward_propagation routine and then they are just passed to update_parameters in the grads dictionary. Please see the section titled:

Exercise 6 - backward_propagation

In the Week 3 assignment. Note that Prof Ng does not use the notation \theta for the parameters in this course. He also tries to avoid using the mathematical notation for partial derivatives. He has invented his own notation in which he uses these shorthands for the gradient values:

dW^{[l]} = \displaystyle \frac {\partial J}{\partial W^{[l]}}
db^{[l]} = \displaystyle \frac {\partial J}{\partial b^{[l]}}

So using his notation, the formula for updating, say, W^{[l]} is:

W^{[l]} = W^{[l]} - \alpha * dW^{[l]}

With that in mind, please read the back propagation section of the notebook again and it should all make sense.

Note that he does not derive these formulas, though: he simply presents them. These courses are specifically designed not to require knowledge of even univariate calculus, let alone matrix calculus. If you have the math background, here’s a thread with pointers to derivations available on the web.

Topic		Replies	Views
Course 1: Week 2 - Derivation of the cost function J with respect to w and b Neural Networks and Deep Learning coursera-platform	4	646	May 25, 2021
Exercise 7 in Week3 requires arguments that are not provided Neural Networks and Deep Learning coursera-platform	2	534	November 1, 2021
DLS 2, Week 2, Exercise 6 update_parameters_with_adam Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	592	August 11, 2022
Week 3 Backpropogation Derivation Neural Networks and Deep Learning course-related , week-module-3 , ai-discussions , coursera-platform	5	39	August 8, 2024
Partial Derivaties Neural Networks and Deep Learning coursera-platform	3	668	March 19, 2022

Week 3 update_parameters, how to compute partial derivative J

Exercise 6 - backward_propagation

Related topics