Assignment 2 - Dense Layer Activation

paulinpaloalto · January 19, 2024, 6:59pm

It’s a little more subtle than that: we actually are using a sigmoid activation at the output layer, but we do it as part of the loss function. Note that we use the from_logits = True argument to the cross entropy loss function.

Here’s a thread which explains why we do it that way. It will always work this way and we first encountered this back in DLS C2 W3 in the TensorFlow Introduction assignment, which is what that thread is discussing.

Then the additional point is that when you want to use the trained model to make a prediction, you either manually add the sigmoid or change the interpretation of the output to say > 0 means True.

Topic		Replies	Views
Why Activation function in last layer - linear - C4W2 Convolutional Neural Networks	1	524	January 16, 2022
Transfer Learning Assignment - Binary Classification Question Convolutional Neural Networks	2	515	July 9, 2023
Why don't I need to specify activation='sigmoid'? Convolutional Neural Networks	3	592	February 22, 2023
Activation Function for Last Layer - Lab Assignment: Neural Networks for Binary Classification Advanced Learning Algorithms week-1	2	501	August 1, 2023
C4W2 activation in output layer Convolutional Neural Networks	1	515	August 19, 2021

Assignment 2 - Dense Layer Activation

Related topics