Question about is_logit

rmwkwok · August 30, 2023, 2:35am

Hello Eakanath @eix_rap,

I want you to look at how we have avoided computing e^{(-z)} where z is too negatively large to overflow the computed result. We have e^{(-z)} because we use sigmoid for binary classification, or we have e^{(z)} because we use softmax for multi-class classification. Note the importance of the choice of the activation, I was therefore feeling unsafe when you said “any activation”.

Ofcourse, I might be over-reacting because you might always have been thinking only about sigmoid and softmax, but please don’t mind and let me make it a bit clearer

Cheers,
Raymond

Topic		Replies	Views
Week 2 - Improved implementation with SoftMax Advanced Learning Algorithms week-module-2	10	719	December 1, 2023
Improved implementation of softmax - Neural network training \| Coursera Advanced Learning Algorithms week-module-2	1	68	June 25, 2024
Exercise 2 - alpaca_model (linear) Convolutional Neural Networks coursera-platform	2	600	August 16, 2023
What exactly does the improved implementation of softmax video mean? Advanced Learning Algorithms week-module-2	9	819	August 18, 2023
Why "logit" stands for the output of linear activation function Advanced Learning Algorithms week-module-2	9	776	December 18, 2023

Question about is_logit

Related topics