Why Sigmoid is used instead of softmax?

Pouria_Baghaei · March 3, 2023, 3:59am

I noticed sigmoid is used in the activation layer of last assignment to classify the output. However in the C3_W3_Lab_2_OxfordPets-UNet Softmax is used. why is that ? both lab and assignment are doing the same segmentation, why we are not using softmax in the C3_W3_Assignment?

Pere_Martra · March 5, 2023, 11:20am

I’m not sure, but I think that the softmax is a better solution, the only solution if you have more than 2 classes. But if in the segmentation problem you have only two classes is possible to use a Sigmoid activator.

Sometimes is just a personal preference, for me is better to use always a softmax in a segmentation decoder.

I encourage you to try to replace the last layer in the assignement and if you do it, share the results with us, please!

Regards.

gent.spah · March 6, 2023, 9:00am

I agree with @Pere_Martra and superficially if you have only 2 outputs shouldn’t make any difference between sigmoid or softmax, but one subtle reason is that these activations are advised to be used with different loss functions sparse_cat_entr vs. binary _entropy. I am guessing this might play a role in here but I have to go more in depth of these assignments before confirming it.

Topic		Replies	Views
Sigmoid vs Softmax Convolutional Neural Networks in TensorFlow week-1	2	584	May 4, 2022
Why softmax is used, if we can do same thing with the sigmoid function? Advanced Learning Algorithms week-2	14	1135	February 9, 2023
Why sigmoid is used for binary classification in vgg16 Convolutional Neural Networks	6	594	November 5, 2022
Softmax output layer vs k sigmoid units in output layer Advanced Learning Algorithms week-2	2	479	April 15, 2023
Assignment 2 - Dense Layer Activation Convolutional Neural Networks week-2	2	310	January 19, 2024

Why Sigmoid is used instead of softmax?

Related topics