Week 2, prog_assgn, Ex-2

Aroonima · October 22, 2021, 2:36pm

Hi, Could anyone explain why the output layer is activated with ‘linear’ function instead of ‘sigmoid’ function.

paulinpaloalto · October 23, 2021, 8:29pm

Because the sigmoid activation is performed as part of the loss calculation. That’s what the from_logits = True argument tells the cost function. Doing it that way is more numerically stable and more efficient (one less call).

Aroonima · October 24, 2021, 12:38pm

But we expect the output to be a binary class and not a logit number, since its an image classification?

paulinpaloalto · October 24, 2021, 3:32pm

The method shown with the loss function is what you do when you are training the network. Either sigmoid or softmax is being applied as part of the loss in training mode, but the actual predictions (the real outputs, not the logits) are also saved and can be accessed. What we are using here is the Keras Model class, which has lots of methods to support the various things you need to do. Model.fit() does the training, but you use Model.predict() when you want the activation outputs. Here’s one of the relevant doc pages.

Aroonima · October 25, 2021, 7:00am

I tried to generate the prediction for a sample data. However, with .predict() method the result were logits and not classes. What needs to be specified in the .predict() as arguments to get classes?

Aroonima · October 25, 2021, 9:22am

I guess, after applying the .predict() method, to get the predictions in classes you have to apply the sigmoid function with threshold. I hope this is the way to go about it.

Topic		Replies	Views
Week 2 Assignment 2 alpaca_model new Convolutional Neural Networks coursera-platform	3	518	June 7, 2022
Exercise 2 - alpaca_model (linear) Convolutional Neural Networks coursera-platform	2	600	August 16, 2023
C4W2 activation in output layer Convolutional Neural Networks coursera-platform	1	515	August 19, 2021
Error in Lab notebook for Test Validation Convolutional Neural Networks coursera-platform	11	428	December 21, 2023
[Week 2] Assignment 2, Exercise 2 : Why should we choose 'linear' output instead of sigmoid output if it's binary classification problem and not linear regression? Convolutional Neural Networks coursera-platform	1	761	April 19, 2021

Week 2, prog_assgn, Ex-2

Related topics