Why Activation function in last layer - linear - C4W2

Iurii_Kaprielov · January 16, 2022, 9:40pm

Hellow.
For Binary classification problem it is logical (and allso from previous courses) to use sigmoid activation function in the output layer. But in programm assesmemt Transfer Learning with MobileNet use linear activation func. I tried different activation func - sigmoid, relu. Yes - Only with linear function nnet was learning. Could sombody explain why - what is intuition ?

paulinpaloalto · January 16, 2022, 9:43pm

They are using sigmoid, but they choose to use the from_logits = True mode of the binary cross entropy loss function to compute sigmoid and the loss together. It is both more efficient and more numerically stable to do it that way. Please read the documentation of TF binary cross entropy loss for more information.

If you look back over past assignments since TF was first introduced in Course 2 Week 3, this is always the way Prof Ng does it.

Topic		Replies	Views
Assignment 2 - Dense Layer Activation Convolutional Neural Networks week-module-2 , coursera-platform	2	313	January 19, 2024
C4W2 activation in output layer Convolutional Neural Networks coursera-platform	1	515	August 19, 2021
[Week 2] Assignment 2, Exercise 2 : Why should we choose 'linear' output instead of sigmoid output if it's binary classification problem and not linear regression? Convolutional Neural Networks coursera-platform	1	761	April 19, 2021
Exercise 2 - alpaca_model (linear) Convolutional Neural Networks coursera-platform	2	600	August 16, 2023
Transfer Learning Assignment - Binary Classification Question Convolutional Neural Networks coursera-platform	2	516	July 9, 2023

Why Activation function in last layer - linear - C4W2

Related topics