RNN Week1 Assignment 2 gradient update

yunds · June 30, 2024, 11:26pm

Hi,

Thanks for spending time reading my post. I have one question on how to update the gradient during backpropagation in Week 1, Assignment 2. In the ‘utils.py’, specifically in function ‘rnn_backward(X, Y, parameters, cache)’, why do we subtract 1 when updating “dy” during backpropagation? If the activation function is softmax, shouldn’t it be dy=y*(1-y)?

Thank you very much in advance for any help.

gent.spah · July 1, 2024, 10:18am

Hello there,

The are calculating the output using cross entropy loss with softmax, and for more details I advise you to check out this post:

yunds · July 1, 2024, 6:07pm

Thank you so much. That’s very helpful.

Topic		Replies	Views
Why do we susbtract 1 in rnn backward provided in C5W1 assignment 2 Sequence Models	1	507	April 29, 2023
DLS 5/Week 1 Prog. Assignment 1 RNN backpropagation fully connected gradient addition Sequence Models	1	564	May 7, 2023
Week 4 \| Building your Deep Neural Network Query Neural Networks and Deep Learning	3	535	February 15, 2022
Dinosaurus_Island_Character_level_language_model optimize Sequence Models week-1	5	471	January 7, 2024
Question about backpropagation in W1 Programming Assignment 1 Sequence Models	1	602	August 18, 2021

RNN Week1 Assignment 2 gradient update

Related topics