C5W1A1Exercise3.1

ltyislbz · July 29, 2021, 6:23am

hi, there,
I am trying to work out the ungraded exercise 3.1, I can run the code but result is wrong.
I believe my mistake shall be the initiation of da_prevt. What I am doing now is:

assign da_prevt with da[: , : , T_x-1] to pick the last array,
when t reverse loop T_x, call function ‘gradients’ with parameters:
“da_prevt” and caches[t]

May I have some guidance about what am I doing wrong? Many thanks!

Leon

kienmn · August 5, 2021, 11:03am

Hi Leon,

When we compute gradients for a step in rnn_backward, the gradients depend on da which is upstream gradients (from the loss backward to a), and da_prevt which is actually gradients from subsequent rnn cell. Note that, da and da_prevt are different.
Hence, in rnn_cell_backward function, you should include both da of current step and da_prevt of previous backward step.
If you still get stuck, my additional hint is to try da[:, :, t] + da_prevt for rnn_cell_backward. Note that da_prevt for the first backward step is zeros because it is the last cell.
Hope this helps.

K

ltyislbz · August 12, 2021, 8:51am

hi, kienmn,

many thanks for the reply. Sorry I didn’t explain my issue clearly.
I have past the “rnn_cell_backward” and the result is same of expected. I am actually stuck at “rnn_backward” part.

I can run the code, but the result is different from expected, as below:

I suspect my mistake happens when retrieve the value for da_prevt and caches. What I write here is:
da_prevt = da[:,:,-1]

and then in the loop, I past two parameters to rnn_cell_backward as below:
gradients = rnn_cell_backward(da_prevt, caches[t])

Am I making some mistakes here?

many thanks for your attention!

Leon

Yuriy · August 17, 2021, 5:33am

You should re-read the answer by @kienmn because he basically spelled it out. But, again, to summarize… You should initialize da_prevt = np.zeros((n_a, m)) like every other gradient. Then, within the loop, call rnn_backward with da[:, :, t] + da_prevt

ltyislbz · August 23, 2021, 4:49am

thank you so much @Yuriy and @kienmn ! problem solved.

Rishan_Tan · March 5, 2024, 8:43pm

Thanks. da[:,:,t]+ da_prev helps and test passes

Topic		Replies	Views
Building_a_Recurrent_Neural_Network_Step_by_Step Sequence Models week-module-1 , coursera-platform	4	344	January 4, 2024
Week 1 Assignment 1 Backpropagation Sequence Models coursera-platform	19	2748	July 20, 2024
C5, W1A1 optional RNN back propagation Sequence Models coursera-platform	10	965	January 2, 2024
Course 5, week 1, assignment 1, exercise 6 rnn_backward Sequence Models coursera-platform	2	755	February 14, 2023
C5W1A1 Help for optional assignment rnn_backward Sequence Models coursera-platform	17	1050	March 15, 2024

C5W1A1Exercise3.1

Related topics