Could someone help explain about this to me?

Enfant · May 17, 2023, 7:27am

That I highlight on the pic I don’t understand why suddenly gets dZ like that
why it not be dz[1] = a[1] - y ?

reinoudbosch · May 24, 2023, 9:26pm

Hi Enfant,

dZ[2] is shorthand for dL/dZ[2], with L referring to the loss function. Because of the definition of the loss function, dZ[2] = A[2] - Y. You can find the (non-vectorized) derivation here.

In its turn, dZ[1] = dL/dZ[2] * dZ[2]/dA[1] * dA[1]/dZ[1] (following the chain rule).

dZ[2]/dA[1] = W[2] while dA[1]/dZ[1] = g[1]'(Z[1]). This leads to the equation you highlight.

Topic		Replies	Views
W3_Vectorization of dZ[2] equations Neural Networks and Deep Learning coursera-platform	5	567	March 31, 2023
BackPropagation Derivation Of 2 Layer Neural Network Neural Networks and Deep Learning week-module-3 , coursera-platform	1	253	March 3, 2024
I don't know the difference between dZL = AL - Y and dZL = dAL .* g'(ZL) Neural Networks and Deep Learning coursera-platform	2	825	February 8, 2022
Derivation of dz=da* g'(z) ? or dz= a- y? how is derivation of dz[1] and dz[2] different? Neural Networks and Deep Learning coursera-platform	10	984	June 1, 2023
dZ[1] derivation Neural Networks and Deep Learning coursera-platform	1	730	November 4, 2021

Could someone help explain about this to me?

Related topics