Deep Learning Specialization, C5, W1

Eshaan_Gupta · April 3, 2024, 3:10am

in Lab-1 where we learn to implement back-propagation in LSTMs from scratch, I doubt that the equation of is wrong.

It should have rather than

so the entire equation should be:
[

](Equation Editor for online mathematics - create, integrate and download))%20%20(1%20-%20tanh%5E2(%5Chat%7Bc%7D%5E%7B%3Ct%3E%7D)#0)*

Please let me know if I am wrong. I can share the derivation as well.

paulinpaloalto · April 3, 2024, 3:41pm

If you look carefully at the diagram of the LSTM cell shown there in the notebook and look at how the various graph segments are labelled, note that my reading is that \tilde{c}^{<t>} is already the output of tanh:

\tilde{c}^{<t>} = tanh(\tilde{pc}^{<t>})

So if my reading is correct, than that last multiplicand there in the overall expression is the correct expression for the derivative of tanh at that point in the graph.

Eshaan_Gupta · April 3, 2024, 4:21pm

It is somewhat confusing how refers to and refers to .

paulinpaloalto · April 3, 2024, 7:07pm

And why dit instead of dut? Yes, there are some slightly odd choices they made there, but notation is always somewhat arbitrary.

Topic		Replies	Views
Week 1 Assignment 1 back-propagation formulas correction Sequence Models coursera-platform	3	670	February 15, 2022
LSTM in pictures week1 - computing a<t> Sequence Models coursera-platform	10	557	January 27, 2022
Backpropagation equation Sequence Models coursera-platform	2	505	March 5, 2023
Is there a typo in back propagation of Course 1 HW3? Neural Networks and Deep Learning coursera-platform	1	501	March 25, 2022
C5_W1_A1 lstm_cell_forward c_next Sequence Models coursera-platform	3	728	July 30, 2021

Deep Learning Specialization, C5, W1

Related topics