Understanding Cost function vs Gradient descent similarities

Durston · July 11, 2022, 10:14pm

Hello!

I’m struggling to come to terms here, pun intended. Specifically, J(w,b).

For the Cost function, it’s defined as one thing which includes being multiplied by (1/2m):

For Gradient Descent, it’s defined as another thing, which instead is multiplied by (1/m):

As I’m typing this out I’m realizing that the latter is NOT actually J(w,b) but rather dJ(w,b)/dw. Now I presume that multiplying J(w,b) and d/dw somehow yields the gradient descent formula from the cost function. Admittedly, I’m not entirely sure what I’m saying and greatly appreciate anyone taking the time to help me understand. I also realize understanding this may be out of the scope of my math knowledge and it’s just something I’ll have to accept.

TMosh · July 11, 2022, 10:32pm

It’s a calculus thing. The gradients are the partial derivative of the cost equation.

Topic		Replies	Views
Optional Lab: Gradient Descent1 Supervised ML: Regression and Classification week-1	4	513	April 28, 2023
What's the usage of J(w,b) for logistic regression? Supervised ML: Regression and Classification week-3	18	692	June 9, 2024
Why does derivatives for w_j in gradient descent differ from b? Supervised ML: Regression and Classification week-2	2	276	November 29, 2023
Gradient Descent Implementation Supervised ML: Regression and Classification week-3	6	714	March 19, 2023
I have problem understanding how to compute the gradient Supervised ML: Regression and Classification week-2	1	520	July 21, 2022

Understanding Cost function vs Gradient descent similarities

Related topics