Loss function for softmax regression when a = 0

francesco4203 · November 30, 2023, 2:07pm

Hi everyone,
In the loss function for softmax regression, if ai approaches 0, -log(ai) approaches -infinity, then how can a model handle this value?
Thanks.

rmwkwok · November 30, 2023, 2:53pm

Hello @francesco4203,

First, this thing → cannot be equal to zero. The best you can get is for it to tend to zero as z_1 tends to negative infinity.

However, in a computer, it is possible because it is not infinitely precise, and in that case, one usual trick is to add a very small number to make this → log ( 1e-7 + a) such that even if a_i numerically becomes 0, the 1e-7 will take care of that. Below is a relevant post on how Tensorflow uses a small number called epsilon to handle that.

Cheers,
Raymond

francesco4203 · December 1, 2023, 9:21am

Clear explanation!
Thank you.

Topic		Replies	Views
Week 2 lesson : softmax Advanced Learning Algorithms week-2	2	220	March 3, 2024
C1_W3_Logistic_Regression_Potential problem Supervised ML: Regression and Classification week-3	4	656	July 18, 2022
Error (video: Softmax) in the graph of the loss function Advanced Learning Algorithms week-2	1	500	August 15, 2022
Got "inf"/"nan" when use Tensorflow to optimize self defined loss AI Discussions	2	53	January 15, 2023
Cost function for logisitic regression : Has Andrew made a mistake here? Supervised ML: Regression and Classification week-3	8	81	February 12, 2025

Loss function for softmax regression when a = 0

Related topics