Activation function of SoftMax after optimization

vinay6 · July 23, 2023, 1:23am

Hi,

In the improved implementation for Softmax lecture, It shows the activation function is been changed from softmax to linear. How does this account for non-linearity which we introduce through softmax to get multiple classes?

rmwkwok · July 23, 2023, 1:26am

All those non-linearity is transferred to the loss function. You don’t just change the output layer’s activation to linear, you also change the configuration of the loss function, and that change in the loss function compensates for the change in the output layer’s activation.

I will only assure you that the non-linearity is always maintained. However, as for how the loss function does that, you need to examine the Tensorflow code yourself.

Raymond

vinay6 · July 23, 2023, 1:41am

By Changing configuration of Loss Function , do you mean the below line on code :

loss=tf.keras.losses.SparseCategoricalCrossentropy(from_logits=True) ?

Can you explain this configuration change as I was not understanding it clearly from the lecture

TMosh · July 23, 2023, 1:42am

Yes, it is the from logits.

vinay6 · July 23, 2023, 12:29pm

Hi,

So when from logits becomes true, is this loss function a linear function and the formula is like the linear cost function (mean squared one) or the Softmax loss optimised loss function which is the log formula one? If it is linear how do we account for the softmax concept?

Also, please explain or share resources in regards with this particular concept where we change it to linear but it is accounting for softmax later and the mathematics behind

TMosh · July 23, 2023, 10:49pm

Everything you would do with an activation and softmax is done automatically when you use from logits true.

Topic		Replies	Views
Softmax implementation Advanced Learning Algorithms week-module-2	6	557	May 11, 2023
Improved implementation of softmax - Neural network training \| Coursera Advanced Learning Algorithms week-module-2	1	73	June 25, 2024
C2_W2_SoftMax lab Advanced Learning Algorithms week-module-2	5	252	March 20, 2024
Improved implementation of softmax regression Advanced Learning Algorithms week-module-2	3	44	July 28, 2024
Numerical correct implementation of softmax Advanced Learning Algorithms week-module-2	6	626	December 24, 2022

Activation function of SoftMax after optimization

Related topics