Alcapa_model: why not sigmoid at prediction?

vuqpham · April 23, 2021, 2:21pm

Assignment Transfer_learning_with_MobileNet_v1:

For binary classification, we use the sigmoid (or softmax for 3 or more outputs). Why do we use the linear activation in the prediction layer of the model ?

Thanks,
Vu

paulinpaloalto · April 23, 2021, 2:36pm

The sigmoid is there: it’s just hidden in the loss function call that they use there. Here’s another recent thread about the same question.

paulinpaloalto · April 23, 2021, 2:39pm

Interesting. The search engine in Discourse seems to work pretty well. I was able to find that thread by searching for “alpaca”. Of course BinaryCrossentropy also works as more specific search term, that you have to already know the answer to the question in order for that to be useful .

vuqpham · April 23, 2021, 2:45pm

You are right. I should have used the Search function.

paulinpaloalto · April 24, 2021, 12:15am

It’s fine either way, but it can save you time if you can just find a pre-existing answer. You never know how long it’s going to take someone to respond to your question. Of course these courses are still pretty new, so there’s no guarantee that any given question has been asked before. It just turns out that this one had been …

lalkrishna · April 25, 2021, 7:47am

For binary classification, we can use both softmax and sigmoid but recommended one is sigmoid.

There is a good explanation by Jeremy.
Just read the below notebook from section ‘Softmax’ onwards.

fastbook/05_pet_breeds.ipynb at master · fastai/fastbook (github.com)

Topic		Replies	Views
Transfer_learning_with_MobileNet: Convolutional Neural Networks	1	498	November 22, 2021
Exercise 2 - alpaca_model (linear) Convolutional Neural Networks	2	596	August 16, 2023
Week2- programming assignment- mobile net- activation funcation Convolutional Neural Networks	2	541	November 3, 2021
[Week 2] Assignment 2, Exercise 2 : Why should we choose 'linear' output instead of sigmoid output if it's binary classification problem and not linear regression? Convolutional Neural Networks	1	759	April 19, 2021
C4 W2 Exercise 2 Convolutional Neural Networks	1	499	October 3, 2022

Alcapa_model: why not sigmoid at prediction?

Related topics