Softmax_Regression

Kamal_Nayan · August 1, 2023, 5:42am

Ok I understand the part where we take the exponent of the matrix and then divide every element by the sum . My question is that We could have done the divide step without taking exponent , right ! Why do we need to take exponents before dividing ??

Nydia · August 1, 2023, 5:50am

Hi @Kamal_Nayan thank you for your question,

if we do not take exponents before dividing in the softmax function, the resulting values would not represent a valid probability distribution over the classes. The softmax function would fail to achieve its primary purpose of converting arbitrary real values into probabilities. Instead, it would retain the original scale of the input values, and they would not be normalized (to sum up to 1).

Kamal_Nayan · August 1, 2023, 6:10am

And can you explain how exactly does exponents help us in converting those numbers into probabilities??
Can you give me a statistical view into this ??

Nydia · August 1, 2023, 6:22am

\exp^{x} satisfies mathematical properties to convert to a probability from a logit. These properties are:

Positivity: \exp^{x} always returns a positive value, since probabilities cannot be negative.
non-linearity: \exp^{x} is highly non-linear, as x increases, there is a bigger difference in the features making the model to be more confident in the predictions.
normalization: if \exp^{x} is divided by the sum of all classes. At the end, each probability will sum up to 1.

These three properties are necessary to define a probability according to what is called Kolmogorov axioms in statistics.

Kamal_Nayan · August 1, 2023, 6:23am

Thanks for this

Topic		Replies	Views
Why softmax is used Neural Networks and Deep Learning	3	572	August 6, 2021
Week 2 - Reason of using exponentials from z to a in Softmax Advanced Learning Algorithms week-2	1	134	May 17, 2024
Why are the outputs not probabilities? Advanced Learning Algorithms week-2	2	502	August 1, 2022
Preferred Implementation of Softmax Advanced Learning Algorithms week-2	3	351	September 5, 2023
Where does this e^z come from while doing softmax? Advanced Learning Algorithms week-2	5	439	July 9, 2023

Softmax_Regression

Related topics