Programming Assignment: Building your Recurrent Neural Network lstm_backprop how to calculate initial dc

Bright_Hsu · May 6, 2023, 1:11am

I’m unsure exactly what to do on the optional assignment for the lstm backpropagation lstm_backward.
The da tensor was given and was randomly generated but dc was never given.
I tried initializing dc_prev to zero and setting the first da_prev = da[:, :, T_x] and passing them into lstm_cell_backward but that also wasn’t correct.
I know I am iterating through the caches right as they should be in reverse from T_x \rightarrow 0 and I have verified that my lstm_cell_backward is coded correctly but what am I missing?

balaji.ambresh · May 6, 2023, 4:27am

Reverse order iteration would mean that you are iterating from the last time step index.

As far as dc is concerned, when you start the backward propagation, there is no value at the end. So, consider the initial values of da_prevt and dc_prevt to be the additive identity. Hint: If a + x = x, what is a?

Bright_Hsu · May 8, 2023, 6:36pm

I’m slightly confused, if da_prevt and dc_prevt are initialized as zero and treated initially as the additive identity then wouldn’t setting da_prevt to the initial value of da, in this case da_prev = da[:,:, T_x - 1], do the same?

I also tried passing da[:, :, t] + da_prevt where da_prevt = np.zeros((n_a, m)) initially to the lstm_cell_backward function but that also wasn’t correct.

balaji.ambresh · May 9, 2023, 5:13am

Did you use the return value i.e. gradients to update the values of da_prevt and dc_prevt ?

Bright_Hsu · May 9, 2023, 5:34pm

Yes, i updated da_prevt and dc_prevt via =, not +=, the gradients where retreived via their dictionary keys.

balaji.ambresh · May 9, 2023, 5:38pm

Please click my name and message your notebook as an attachment.

balaji.ambresh · May 9, 2023, 6:39pm

Please fix lstm_cell_backward.
Calculation of dc_prev has a bug.

Bright_Hsu · May 9, 2023, 6:48pm

Thank you, that was the issue.
It was hard to spot what was wrong, wasn’t expecting it to be the forget gate value that I was missing since the unit test given for the lstm_cell_backward didn’t show a huge change in the value so I though it may have just been a float point rounding error.

Topic		Replies	Views
Course 5 Week 1 Assignment 1 Where is dc_next for lstm_backwards Sequence Models coursera-platform	4	640	October 18, 2021
C5W1 - Assignment 1 - Optional Part - lstm_backward - missing parameter Sequence Models coursera-platform	5	695	December 7, 2023
Backpropagation of LSTM Sequence Models coursera-platform	1	520	October 17, 2022
C5W1A1 optional hw Sequence Models coursera-platform	11	1056	October 21, 2022
C5W1 A1 (Ex8) lstm_backward, dc_next missing? Sequence Models coursera-platform	13	740	May 4, 2023

Programming Assignment: Building your Recurrent Neural Network lstm_backprop how to calculate initial dc

Related topics