[Week 1] How are the weights updated in backpropagation thorough time?

David_Farago · September 4, 2022, 6:53pm

Thank you Paul for your answer, covering many SGD aspects and thus drawing the big picture.

If you are adding them one at a time (without repetitively recomputing the loss for earlier time steps), you will not change the loss function, but the coordinate where you take the gradients, right? E.g. if you immediately update the weights based on the gradient for \mathcal{L}^{T_y} , will you not take the gradient for \mathcal{L}^{T_y-1} at a slightly different coordinate?

Topic		Replies	Views
Backpropagation in RNN weight sharing Sequence Models	4	771	February 23, 2022
RNN backpropagation for each time step Sequence Models	2	650	September 3, 2022
RNN Assignment-1 Week 1 Sequence Models	2	682	April 21, 2022
Backprop with shared layers Sequence Models	1	542	June 25, 2021
Vanishing Gradient RNN Sequence Models	7	536	April 6, 2022

[Week 1] How are the weights updated in backpropagation thorough time?

Related topics