Derivation of Backpropagation in RNNs

mahesh-mantri · May 25, 2024, 4:00am

Can anybody helps me in derivation of those variables in back propagation -

Alireza_Saei · May 25, 2024, 9:18am

Hey there @mahesh-mantri ,

I assume that you want explanation for each one of them:

dtanh → Derivative of tanh activation function w.r.t. the input to the activation function.
dW_{ax} → Gradient of the loss w.r.t. the weight matrix connecting the input to the current hidden state.
dW_{aa} → Gradient of the loss w.r.t. the weight matrix connecting the previous hidden state to the current hidden state.
db_a → The gradient of the loss w.r.t. the bias term in the hidden state update.
dx^{(t)} → The gradient of the loss w.r.t. the input at time step t.
da_{prev} → Gradient of the loss w.r.t. the previous hidden state.

Feel free to ask if you need further assistance!

mahesh-mantri · May 25, 2024, 11:04am

Hello @Alireza_Saei , I was asking for how we can derive these formulae , want to understand the math behind them.

Alireza_Saei · May 25, 2024, 11:25am

These are some basic derivatives. You can find the formulas almost everywhere! Try to understand them yourself, but if you ever feel stuck, feel free to ask, and I can explain them!

TMosh · May 26, 2024, 2:12am

This requires first that you understand calculus. If you do, then the derivatives are straightforward.

Topic		Replies	Views
Backpropagation equation Sequence Models	2	505	March 5, 2023
Week 4 backward propagation da[l-1] derivation Neural Networks and Deep Learning	2	834	July 24, 2021
Week 1 Assignment 1 back-propagation formulas correction Sequence Models	3	670	February 15, 2022
Explanation for derived gradients for LSTM back-prop? Sequence Models	3	678	September 6, 2021
Course 1: Week 3 (backpropagation intuition) Neural Networks and Deep Learning	21	5190	April 27, 2022

Derivation of Backpropagation in RNNs

Related topics