Additional resources back-propagation in sequence models

Krishna_Prasanna · May 25, 2021, 1:31pm

Hi all,
I want to understand the mechanisms of back-propagation in RNN and LSTM. I was working on the week 1’s 1st assignment’s optional section, but I wish to delve more into theoretical derivations for back-prop.
Specifically, I want to know

How da_next is computed.
Why is there the following extra term present in back-prop derivative equation for LSTM forget and input gates:
𝑑𝑐𝑛𝑒𝑥𝑡∗𝑐𝑝𝑟𝑒𝑣 (In reference to the notebook formulae)
This is the LSTM equation:

back_prop_lstm_equations1257×319 35.5 KB

TMosh · June 27, 2021, 6:21am

Sorry, I’m not able to reply about how the backpropagation works. You might need to search the internet for answers on this.

jongchayong · December 9, 2021, 8:40am

Hi, I am looking on this too. Do you have answer for this yet?

jonaslalin · December 10, 2021, 9:24am

Hello @Krishna_Prasanna and @jongchayong,

I haven’t had time to write about LSTMs yet, but maybe my derivation of fully-connected layers can inspire you to attempt the derivation by yourselves.

Topic		Replies	Views
LSTM NN Derivatives Sequence Models coursera-platform	1	483	October 31, 2022
Backpropagation in LSTM Sequence Models week-module-1 , coursera-platform	11	68	January 1, 2025
Derivation of Backpropagation in RNNs Sequence Models week-module-1 , coursera-platform	4	111	May 26, 2024
LSTM backpropagation confusion Sequence Models week-module-1 , coursera-platform	2	57	November 12, 2024
Derivation of backpropagation of RNN Sequence Models coursera-platform	2	810	June 5, 2022

Additional resources back-propagation in sequence models

Related topics