C2_W2_SoftMax Cost computation and partial derivatives

pritamdodeja · July 8, 2022, 4:33pm

Screenshot 2022-07-08 12.24.56 PM

Is the 1/m term missing here in the cost function? Also, how do the partial derivatives get computed for the cost function given that it is not in “one piece”? Before we used the 1-y trick to put it together. Does the same thing happen here with just more terms?

Thank you!

TMosh · July 9, 2022, 2:08am

Since this is the softmax calculation, there’s no 1/m term required.
This is simply re-scaling the output values from logistic regression, so they all sum to 1.

pritamdodeja · July 9, 2022, 2:58am

I thought the m term came from the samples, which is independent of the fact that it’s softmax right? That left most summation from 1 to m, I thought that was accumulating the losses, which would then get averaged out for the batch. I can understanding the a vector adding up to 1, but do not understand how the losses would add add up that way. Thank you!

rmwkwok · July 9, 2022, 3:16am

Hi @pritamdodeja ,

The 1/m term is not needed for the softmax of 1 sample. But I think we need the 1/m for the cost of all samples. I will share this with the course team.

We don’t use the 1-y trick here because there are more than 2 classes assumed, instead we use the indicator function \mathbb{1} as defined in the first line of your screenshot.

Raymond

Topic		Replies	Views
Vectorizing Logistic Regression's Gradient Output - why no 1/m? Neural Networks and Deep Learning coursera-platform	2	408	July 18, 2023
Why was cost function for Logistic reg 1/m and not 1/2m? Supervised ML: Regression and Classification week-module-3	5	39	September 23, 2024
Calculation of partial derivative of the cost function for logistic regression Supervised ML: Regression and Classification week-module-3	60	176	February 25, 2025
Cost function of Softmax function Advanced Learning Algorithms week-module-2	2	250	May 26, 2024
Possible typo (missing 1/m) Neural Networks and Deep Learning coursera-platform	3	595	August 21, 2022

C2_W2_SoftMax Cost computation and partial derivatives

Related topics