Softmax activation function

Syamantak_Paliwal · March 4, 2023, 5:44am

Hi Sir,
“Unlike other activation functions, the softmax works across all the outputs.”
What does this statement mean?

TMosh · March 4, 2023, 5:58am

It means that all of the outputs are scaled so that their sum is exactly 1 for each example.

subagopa · March 4, 2023, 8:02am

Hi @Syamantak_Paliwal ! Su here!
To put your question in simple terms:
Softmax → Gives Probability of each possible class for output of a neural network
Other functions (eg. sigmoid/ReLu) → Only gives a number for each output

Topic		Replies	Views
Softmax layer Clarification Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	560	August 7, 2021
Softmax layer intuition Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	556	August 6, 2021
Why use Softmax instead of a linear transform that sums to 1? Neural Networks and Deep Learning coursera-platform	5	938	May 28, 2021
DL SP: Course: 1: Week2, practice_python_with_numpy, Exercise 7 - softmax Neural Networks and Deep Learning coursera-platform	1	537	September 11, 2021
Softmax layer at last layer Neural Networks and Deep Learning coursera-platform	1	543	April 15, 2022

Softmax activation function

Related topics