Optional Lab: Back propagation using a computing graph

VeronikaS · February 21, 2023, 8:59am

Hello!

Why do we need to compute all the derivatives of the nodes from right to left if we are able to compute ∂𝐽/∂𝑤 arithmetically at the beginning of the graph ?

J_epsilon = ((w+0.001)*x+b - y)**2/2
k = (J_epsilon - J)/0.001

rmwkwok · February 21, 2023, 12:24pm

Hi @VeronikaS,

It is because we can reuse the result from layers on the right in the layers on the left. It is more apparent if we look at this slide:

See the green arrows in the bottom part of the slide?

If we can reuse something, we save some computation time.

Cheers,
Raymond

VeronikaS · February 21, 2023, 12:57pm

Thank you, @rmwkwok.

One more question about this lab:

do we use the same epsilon for every node of the graph during the back prop?

rmwkwok · February 21, 2023, 1:14pm

@VeronikaS, if you are asking about actual backprop when training a model, we don’t really use epsilon because it can introduce rounding errors. We use epsilon in the optional lab just for demonstration purpose and to provide a way to compute derivatives without the need to learn differentiation.

Cheers,
Raymond

Topic		Replies	Views
Didn't understand how gradient computation using back prop is order of N+P Advanced Learning Algorithms week-module-2	4	262	February 26, 2024
Computation graph: N + P vs N x P Advanced Learning Algorithms week-module-2	16	698	April 1, 2024
Reason for using BackProp for calculating derivative Advanced Learning Algorithms week-module-2	3	384	December 2, 2023
One step of backward propagation on a computation graph yields derivative of final output variable Neural Networks and Deep Learning coursera-platform	1	434	July 20, 2023
Computing derivatives: why not analyticaly? Advanced Learning Algorithms week-module-2	4	464	May 26, 2023

Optional Lab: Back propagation using a computing graph

Related topics