Week 1 Regularization Backprop code doubt

RishabhSheoran · June 5, 2021, 12:57pm

Why do we write:
dZ2 = np.multiply(dA2, np.int64(A2 > 0))
Wasn’t dA2 supposed to be multiplied with the derivative of activation_function(Z2)? What is the significance of np.int64(A2 > 0)?

paulinpaloalto · June 5, 2021, 3:03pm

It is the derivative of the activation function, which is ReLU is this case, right? Think about it for a sec and it should make sense. Sure, they could have written it as np.int64(Z2 > 0) and maybe that would have been more obvious, but the result is the same, right?

RishabhSheoran · June 6, 2021, 8:20am

Thank you, it made my doubt clear. One more thing: in np.int64(A2 > 0), why do we write A2>0, shouldn’t it be just np.int64(A2)?

paulinpaloalto · June 6, 2021, 3:07pm

Sorry, that wouldn’t work. What is the derivative of ReLU? It’s 0 for inputs <= 0 and it’s 1 for inputs > 0, right? That’s what the expression A2 > 0 or Z2 > 0 gives you, but with the datatype Boolean. Then you convert that to a numeric value. Actually I’d think it would make more sense to convert it to a float, rather than an integer, but the type coercion rules make integer work just as well.

ReLU(4.3) == 4.3, right? So int64 of that is 4, not 1.

Topic		Replies	Views
Week1, programming exercise 2, np.int64(A2 > 0) Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	550	July 22, 2021
Problem in the code given for calculation of dz2 and dz1 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	535	January 15, 2022
C2 W1 / Regularization / Exercise 2 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	564	February 15, 2022
Wk 1, Grading checking, Sec. 5: dZ2 = np.multiply(dA2, np.int64(A2 > 0))? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	544	October 6, 2022
What is "dZ1 = np.multiply(dA1, np.int64(A1 > 0))" Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	574	August 29, 2022

Week 1 Regularization Backprop code doubt

Related topics