DLS 5/Week 1 Prog. Assignment 1 RNN backpropagation fully connected gradient addition

vorpalsnark · May 6, 2023, 11:57pm

I’m doing the optional backpropagation part of the RNN programming assignment 1 and I spent an an embarrassing amount of time trying to figure out why my

def rnn_backward(da, caches):

method was returning the wrong answer. There is a longer 2021 thread on this in Week 1 Assignment 1 Backpropagation but like others in that thread, I overlooked the addition of the cost derivatives from the output/fully connected layer in my implementation.

I realize the backpropagation section is optional/advanced, and I know that the addition is pointed out in the note for Figure 7 but given how many people have been confused by it, it might be worth both adding to the comment in the skeleton code of rnn_backward and going into a bit more depth in the assignment on this aspect of the back propagation?

TMosh · May 7, 2023, 2:24am

Thanks for the suggestion.

Topic		Replies	Views
C5, W1A1 optional RNN back propagation Sequence Models coursera-platform	10	964	January 2, 2024
Derivation of backpropagation of RNN Sequence Models coursera-platform	2	810	June 5, 2022
DLS 5 - Week 1 - A1 - Build you own RNN. Getting wrong output Sequence Models coursera-platform	3	549	June 8, 2022
Why do we susbtract 1 in rnn backward provided in C5W1 assignment 2 Sequence Models coursera-platform	1	510	April 29, 2023
LSTM NN Derivatives Sequence Models coursera-platform	1	483	October 31, 2022

DLS 5/Week 1 Prog. Assignment 1 RNN backpropagation fully connected gradient addition

Related topics