Programming Assigment: Softmax activation is not applied

Anil_Purohit · January 2, 2022, 7:21am

Hi,

For week3 programming assignment, Why are we not applying softmax activation function on the output layer for model method for every minibatch. This is the code where I am having the confusion. Why are we computing loss before applying softmax on Z3.

# 1. predict
Z3 = forward_propagation(tf.transpose(minibatch_X), parameters)
# 2. loss
minibatch_cost = compute_cost(Z3, tf.transpose(minibatch_Y))

paulinpaloalto · January 2, 2022, 7:42pm

You’re right that we don’t compute the activation on the output layer in forward propagation. This is a very standard way to do things: what happens is that we compute the sigmoid or softmax output activation as part of the cost computation. There is an argument from_logits that we use to tell the cost function to do that. See the documentation for the loss function, which they linked in the instructions of that portion of the assignment. The reason is that it is both more efficient and more numerically stable to do it that way: e.g. they can handle the “saturation” cases in an efficient way among other things. You’ll find that Prof Ng uses this method in all cases in which we are using TF to implement and train models. It’s less code and it works better, so what is not to like about that?

Topic		Replies	Views
Why doesn't forward_propagation contain the activation values? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	7	529	February 3, 2023
C2W3 Tensorflow Intro: 3.3 Train the Model Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	337	October 18, 2023
Question about week 3 assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	697	August 8, 2022
Model Output with and without Softmax Activation / from_logits=True Advanced Learning Algorithms week-module-2	11	494	June 1, 2023
C2 W3 Tensorflow assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	563	October 26, 2022

Programming Assigment: Softmax activation is not applied

Related topics