Emojifier-V1 Gradient Descent Code

hungng777 · May 31, 2024, 8:40pm

Emojifier-V1 has the following gradient descent code in the function model():

Compute gradients

dz = a - Y_oh[i]
dW += np.dot(dz.reshape(n_y,1), avg.reshape(1, n_h))
db += dz

How are the gradients derived?

Thanks in advance,

paulinpaloalto · May 31, 2024, 10:06pm

The derivation of backpropagation requires calculus which is beyond the scope of these courses. There was an optional section in the first assignment in C5 W1 (Building an RNN Step by Step) that shows the formulas for the general cases of backpropagation for an RNN or LSTM network. Please have a look at the general case there and perhaps that will “map” to what we are doing here.

Topic		Replies	Views
Why do we susbtract 1 in rnn backward provided in C5W1 assignment 2 Sequence Models coursera-platform	1	510	April 29, 2023
Explanation for derived gradients for LSTM back-prop Sequence Models coursera-platform	1	561	June 15, 2022
Derivation of Backpropagation in RNNs Sequence Models week-module-1 , coursera-platform	4	111	May 26, 2024
RNN Week1 Assignment 2 gradient update Sequence Models coursera-platform	2	52	July 1, 2024
Additional resources back-propagation in sequence models Sequence Models coursera-platform	3	620	December 10, 2021

Emojifier-V1 Gradient Descent Code

Compute gradients

Related topics