Course1 Week4 Cost / Lost at the end of Forward pass

andrzej_pietrusiak · January 31, 2022, 11:12am

I tried to redo lecture forward/backward cycle, marking what matrix sizes I have eveywhere based on few assumptions (m=2, n[0]=3, n[1]=4, n[2]=3, n[3]=1).

Everything makes sense exept the Loss, or rather cost? Yea here is the problem. I figured what size dA[last] has to have to make it work. But Here is my question:

How do I calculate loss, do I give pairs of elements to my loss function and then put them in the vector? Yea alright that makes sense, but what about Cost, because at the end of the day we want to minimize COST, not individual losses.

I don’t know how to make my question more clear, I just want to know what exactly is happening at the end of forward pass. (my gut is telling me those are connected and it makes sense, but my brain doesn’t).

My thought process as a gif
2022-01-31-12-14-58

paulinpaloalto · January 31, 2022, 5:04pm

This was all covered in the lectures and is apparent in the formulas you show and those shown in the notebook. The cost is the average of the loss values across all the samples. For each sample you get a scalar value as the loss value on that sample computed according to the cross entropy loss formula. Then you average those values to get the cost. Then back prop will try to minimize that cost value, meaning that it is taking all the samples into account.

Topic		Replies	Views
J(w,b) cost function Improving Deep Neural Networks: Hyperparameter tun	6	535	February 14, 2022
Intuitive understanding of forward and backward propagation Neural Networks and Deep Learning week-2	3	188	March 19, 2024
Backpropagation formulas Neural Networks and Deep Learning	7	1041	April 21, 2021
Course 2 Week 3: compute cost solution is wrong? Improving Deep Neural Networks: Hyperparameter tun	2	519	November 3, 2022
W2_Ex-5_Loss Function Homework Question Neural Networks and Deep Learning	3	510	December 30, 2022

Course1 Week4 Cost / Lost at the end of Forward pass

Related topics