Cost function of Softmax function

Kavalanche · May 26, 2024, 10:51am

In this equation , i cannot understand why it takes logarithm of the specific class say for ex . our softmax ouptut is [0.1,0.7,0.2] , then i am confused that if the true class of this example is say 1 , then why we take log(0.1) - assuming indexing starts from 1 and ignore 0.7 and 0.2 while calculating the loss so they aren’t included in the cost function too… , I know i explained very badly , will try to improve

TMosh · May 26, 2024, 4:50pm

The loss is computed on each example separately, then all of those loss values are summed.

XinghaoZong · May 26, 2024, 6:09pm

Hi Kavalanche!

In the screenshot, it’s mentioned that only the line corresponding to the target class contributes to the loss. This means that when calculating the cost (or loss), we consider only the output value associated with the correct category.

For example, suppose the model’s output is [0.1, 0.7, 0.2] for three different categories. It represents the “likelihood” of each category. However, the target category is the first one, and the output probability for that category is 0.1. Therefore, we include only the output value for the first category when computing the cost.

This approach ensures that the loss reflects how well the model performs specifically for the correct class, rather than being influenced by other categories.

Topic		Replies	Views
What is the Cost Function for Softmax? Advanced Learning Algorithms week-module-2	121	419	May 18, 2025
Softmax Loss Function for single example Advanced Learning Algorithms week-module-2	18	595	December 30, 2022
Loss Function of Week 3 Neural networks topic Neural Networks and Deep Learning coursera-platform	4	679	February 12, 2024
Softmax Loss function Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	628	May 7, 2021
Cost Function and Loss Function Supervised ML: Regression and Classification week-module-3	10	858	September 20, 2023

Cost function of Softmax function

Related topics