Why do we susbtract 1 in rnn backward provided in C5W1 assignment 2

ashok_v · April 29, 2023, 3:05am

START CODE HERE

# Backpropagate through time
for t in reversed(range(len(X))):
    dy = np.copy(y_hat[t])
    dy[Y[t]] -= 1
    gradients = rnn_step_backward(dy, gradients, parameters, x[t], a[t], a[t-1])

ashok_v · April 29, 2023, 3:17am

Okay, I figured this out. This is due to this being the softmax derivative. (a-y), since there was no comment . It felt difficult to immediately guess this

Topic		Replies	Views
RNN Week1 Assignment 2 gradient update Sequence Models how-to	2	49	July 1, 2024
D5W1 A1 Assignment Exercise 6 rnn_backward need help Sequence Models	2	777	February 14, 2023
Question about backpropagation in W1 Programming Assignment 1 Sequence Models	1	604	August 18, 2021
Week1 Assignment1 Backpro question Sequence Models	3	592	August 16, 2021
C5, W1A1 optional RNN back propagation Sequence Models	10	962	January 2, 2024

Why do we susbtract 1 in rnn backward provided in C5W1 assignment 2

START CODE HERE

Related topics