Why does derivatives for w_j in gradient descent differ from b?

D1ZER99 · November 26, 2023, 9:53am

In gradient descent both for linear and logistic regression derivatives for w_j and b are different, when where is an x_j[i] outside of the brackets for w_j. So my question is why why do we have x_j[i] multiplication for w_j and do not have for b?

TMosh · November 26, 2023, 4:37pm

That’s how the result becomes when you do the calculus for the partial derivative of the cost with respect to b.

Intuitively, if you look at f_wb = w*x + b, you can see that ‘w’ is scaled by x, but ‘b’ is not. This is the basis for why dj_dw and dj_db have different forms.

D1ZER99 · November 29, 2023, 2:43pm

Thanks for your reply. Intuitively, I’ve also thought about the absence of x in the b part, however, after your reply I have confidence.

Topic		Replies	Views
Asking about derivative of w[j] in gradient descent for logistic regression Supervised ML: Regression and Classification week-3	2	652	September 18, 2022
Why derivative of w turns out to have x(i) why that of b does not Supervised ML: Regression and Classification week-1	8	581	September 5, 2023
Logistic Regression Derivative of J(w,b) Supervised ML: Regression and Classification week-3	12	1074	May 16, 2023
Understanding Cost function vs Gradient descent similarities Supervised ML: Regression and Classification	1	240	July 11, 2022
Difference between step function of parameters w and b Supervised ML: Regression and Classification week-1	3	565	April 22, 2023

Why does derivatives for w_j in gradient descent differ from b?

Related topics