Week2 softmax function

flyunicorn · May 17, 2025, 7:56am

I understand the usage of softmax function is to turn the final output into probablity for each class. But why we don’t use a1=z1/(z1+z2+z3+z4) which can also make the output between 0 and 1? Why we we use e to the power of z instead?

Alireza_Saei · May 17, 2025, 8:57am

Hi @flyunicorn

Simply using a1= \frac{z_1}{z_1 + z_2 + z_3 + z_4} only works if all z values are positive—if any z_i is negative, the result can be undefined or misleading. The softmax function uses e^{z_i} to keep all values are positive, emphasize larger scores exponentially, and keep outputs between 0 and 1 while their sum is 1.

Hope it helps! Feel free to ask if you need further assistance.

Topic		Replies	Views
Softmax formula Advanced Learning Algorithms week-module-2	1	494	March 14, 2023
Where does this e^z come from while doing softmax? Advanced Learning Algorithms week-module-2	5	451	July 9, 2023
Week 2 - Reason of using exponentials from z to a in Softmax Advanced Learning Algorithms week-module-2	1	142	May 17, 2024
Confusion about the mathematical formula of a1,a2,a3,a4 in softmax regression Advanced Learning Algorithms week-module-2	4	330	October 25, 2023
Model Output with and without Softmax Activation / from_logits=True Advanced Learning Algorithms week-module-2	11	491	June 1, 2023

Week2 softmax function

Related topics