J(w,b) cost function

Ian_Proffitt · February 13, 2022, 10:29pm

Hi mentors.
I am taking a big step back at the moment to revise and improve my understanding, as opposed to just obtaining results.
Can you clear up something I didn’t take the time to understand properly right at the start.
Given an input matrix X[n,m].
is there a J for every row [n] added across the [m] examples of x(i).
My logic says that J must be a [n,2] vector but I haven’t seen this defined anywhere.; I would expect there to be ‘n’ number of j (small case) sums. with j[0] = sum(dW) and j[1] = sum(db)
Regards
Ian

paulinpaloalto · February 13, 2022, 10:49pm

There is a loss value for each sample. That’s each column of X. That is a scalar value for each sample. Then we define the cost J as the average of the loss values across all m samples.

Ian_Proffitt · February 13, 2022, 10:55pm

Yes, I should have written ‘average’ I understood that.
The implication of what you say is that J (capital) is a single number at each iteration of the calculation?

paulinpaloalto · February 13, 2022, 10:55pm

To express it in math formulas, we have the loss first:

L(\hat{y}_i, y_i) = - y_i * log(\hat{y}_i) - (1 - y_i) * log(1 - \hat{y}_i)

Then the cost is defined as the average of the loss values across the samples:

J = \displaystyle \frac {1}{m} \sum_{i = 1}^{m} L(\hat{y}_i, y_i)

paulinpaloalto · February 13, 2022, 10:57pm

Yes, J is a scalar value at every iteration.

Ian_Proffitt · February 13, 2022, 11:00pm

I need to research that a little further - my matrix math is rusty.
I defer to you sensei
Ian

Ian_Proffitt · February 14, 2022, 1:14pm

Thanks Paul. I have got my head around that now. Understanding that explains a lot that I was unsure of further into the second course .
Regards
Ian

Topic		Replies	Views
Course2_week2_assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	611	June 28, 2021
Course1 Week4 Cost / Lost at the end of Forward pass Neural Networks and Deep Learning coursera-platform	1	509	January 31, 2022
W4_A1_Inconsistent cost function notation in formula 8 and 9 Neural Networks and Deep Learning coursera-platform	3	527	January 18, 2023
Week 3 - Backpropagation Intuition - gradient descent Neural Networks and Deep Learning coursera-platform	1	498	July 18, 2022
How to compute J(w) in gradient checking Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	572	January 6, 2023

J(w,b) cost function

Related topics