Question about is_logit

rmwkwok · February 17, 2024, 3:38am

We don’t just prefer to use linear instead of sigmoid for activation. We prefer to use linear for activation AND setting from_logit to True in the loss function that’s passed into the model training. If you following these steps, you will see that sigmoid is never out of the game.

Again, sigmoid is there if we set from_logit to True. It is NOT in the output layer because we have linear for the layer’s activation, however, it IS in the loss function if we set from_logit to True.

Do the experiment yourself

Cheers,
Raymond

Topic		Replies	Views
Why "logit" stands for the output of linear activation function Advanced Learning Algorithms week-2	9	764	December 18, 2023
Week 2, prog_assgn, Ex-2 Convolutional Neural Networks	5	529	October 25, 2021
What exactly does the improved implementation of softmax video mean? Advanced Learning Algorithms week-2	9	817	August 18, 2023
Practice quiz: Multiclass Classification Advanced Learning Algorithms week-2	1	537	June 18, 2022
Improved implementation of softmax - Neural network training \| Coursera Advanced Learning Algorithms week-2	1	67	June 25, 2024

Question about is_logit

Related topics