Unable to understand the equations for calculating grad_b1 and grad_b2 in the back_prop function

Amit_Gairola1 · November 3, 2023, 10:26am

Can any one explain the equation for grad_b1 and grad_b2. The lecture says

what is meant by step(Z1) and the matrix 1m

conscell · November 5, 2024, 5:13am

grad_b1 and grad_b2 are the gradients w.r.t. biases, i.e.
\frac{\partial J_{batch}}{\partial b_1} and \frac{\partial J_{batch}}{\partial b_2} respectively. The step function is needed for the backward propagation through the ReLU non-linearity. For every positive element of Z1 it should output one, and for other elements it should output zero. 1_m is a row vector containing m elements, all equal to 1. As you can see on the slide, the result of A \cdot 1^\top_m is equivalent to summing the elements of each row of the matrix A .

Topic		Replies	Views
C2_W4_Assignment test_back_prop fails by b1 value NLP with Probabilistic Models week-module-4	9	690	August 6, 2024
Backprop: no context around the 'step' notation NLP with Probabilistic Models week-module-4	1	351	October 25, 2024
W4_unittest.test_back_prop(back_prop)- Failing NLP with Probabilistic Models week-module-4	3	598	November 25, 2023
Unable to pass the unit test for the calculation of grad_b1 for Ex 4 in assignment week 4 NLP with Probabilistic Models week-module-4	8	395	November 22, 2023
NLP C2W4: w4_unittest error (back_prop) NLP with Probabilistic Models week-module-2	3	389	March 25, 2024

Unable to understand the equations for calculating grad_b1 and grad_b2 in the back_prop function

Related topics