Hey there,
I figure out da0
(the one before return gradients) doesn’t make any sense, or does it has a special job?
rnn_backward()
lstm_backward()
Correct me if I am wrong
Hey there,
I figure out da0
(the one before return gradients) doesn’t make any sense, or does it has a special job?
rnn_backward()
lstm_backward()
Correct me if I am wrong
Do you still have an open question here?