A general question on gradient descent wrt C1_W2 excercise


Why are we ignoring “w-” and “b-” ? Is it because it is specified in the question or we generally ignore it?
Mentors please help.
I have solved the question but I would like to know , whether its just because we are ignoring it to solve this specific problem( as per the intention of the question framer)??

Hey @John21,
We aren’t ignoring them. In the Exercise 2 compute_gradient function, we are only supposed to calculate the gradients, i.e., \frac{1}{m} \sum_{i=1}^M (f_{w, b}(x^{(i)}) - y^{(i)}) x^{(i)} and \frac{1}{m} \sum_{i=1}^M (f_{w, b}(x^{(i)}) - y^{(i)}). If you take a look at the gradient_descent function, you will find that we have used these gradients to update the parameters as per the update equations only. I hope this helps.

Regards,
Elemento

1 Like