C4W2 activation in output layer

khiemlk17 · August 19, 2021, 10:35pm

In the transfer learning exercise (week 2), when adding the Dense layers, why don’t we use a sigmoid activation although we are dealing with a binary classification problem ?

paulinpaloalto · August 19, 2021, 11:01pm

That is because the TF/Keras loss function all support the mode in which we feed the linear activation output to the loss function and then let the loss function compute the activation and loss together. This is specified by the from_logits = True parameter. The reason for using the loss functions this way is that it is a) more efficient (one less call) and b) more numerically stable (it’s easier to deal with saturated sigmoid value for example). You’ll see that we always use that mode in both binary and categorical cases.

Please have a look at the documentation for the loss function if what I said above is not enough to fully explain this.

Topic		Replies	Views
Why Activation function in last layer - linear - C4W2 Convolutional Neural Networks	1	525	January 16, 2022
Week 2, prog_assgn, Ex-2 Convolutional Neural Networks	5	529	October 25, 2021
Assignment 2 - Dense Layer Activation Convolutional Neural Networks week-2	2	310	January 19, 2024
Transfer Learning Assignment - Binary Classification Question Convolutional Neural Networks	2	516	July 9, 2023
Where is the activation function in Week 2 - Transfer Learning assignment Convolutional Neural Networks	6	519	July 10, 2022

C4W2 activation in output layer

Related topics