Assignment 2 - Dense Layer Activation

ahmedjolani · January 19, 2024, 4:59pm

Hi,

I noticed that for Transfer Learning assignment we are not using a Sigmoid activation for the last layer. Is this because the network has many non-linear layers so the ability to learn a complex function is not compromised?

I was expecting to use a Sigmoid activation in the last layer since it’s a binary classification problem. I realize that the we are using the binary cross entropy loss function so we will anyhow encourage the neuron to produce a result between [0, 1].

Can you explain briefly under what condition using a linear activation at the last layer is worse than a Sigmoid? Btw, I am aware of Sigmoid’s saturation problem.

paulinpaloalto · January 19, 2024, 6:59pm

It’s a little more subtle than that: we actually are using a sigmoid activation at the output layer, but we do it as part of the loss function. Note that we use the from_logits = True argument to the cross entropy loss function.

Here’s a thread which explains why we do it that way. It will always work this way and we first encountered this back in DLS C2 W3 in the TensorFlow Introduction assignment, which is what that thread is discussing.

Then the additional point is that when you want to use the trained model to make a prediction, you either manually add the sigmoid or change the interpretation of the output to say > 0 means True.

ahmedjolani · January 19, 2024, 8:08pm

Ah yes, I remember now. Thanks for the quick reply, I greatly appreciate it!

Topic		Replies	Views
Why Activation function in last layer - linear - C4W2 Convolutional Neural Networks	1	524	January 16, 2022
Transfer Learning Assignment - Binary Classification Question Convolutional Neural Networks	2	514	July 9, 2023
Why don't I need to specify activation='sigmoid'? Convolutional Neural Networks	3	591	February 22, 2023
Activation Function for Last Layer - Lab Assignment: Neural Networks for Binary Classification Advanced Learning Algorithms week-1	2	501	August 1, 2023
C4W2 activation in output layer Convolutional Neural Networks	1	514	August 19, 2021

Assignment 2 - Dense Layer Activation

Related topics