Improved implementation of softmax regression

aimen_ameer · July 28, 2024, 6:31pm

I am really struggling to understand why we set the output layer to linear when we add the flag from_logits=True in the improved implementation of softmax regression, and later we apply the softmax regression to the model with this code:

logits = model(X)
f_x = tf.nn.softmax(logits)

How is this conceptually the same thing as the previous implementation?

TMosh · July 28, 2024, 6:41pm

When you use a linear output and the correct loss function, and specify from_logits = True, then TensorFlow automatically uses a pre-implemented version of softmax that is computationally very efficient.

aimen_ameer · July 28, 2024, 7:53pm

Thank you for your kind response, but I am still a bit unclear about this. How does it know it’s supposed to use softmax when we have made no specification and we are using a linear output layer? Without softmax, wouldn’t the model be outputting something else entirely?

TMosh · July 28, 2024, 9:59pm

Softmax is included when you specify the categorical cross-entropy cost function.

Topic		Replies	Views
Softmax implementation Advanced Learning Algorithms week-2	6	527	May 11, 2023
Improved implementation of softmax - Neural network training \| Coursera Advanced Learning Algorithms week-2	1	67	June 25, 2024
Practice quiz: Multiclass Classification Advanced Learning Algorithms week-2	1	537	June 18, 2022
C2_W2_SoftMax lab Advanced Learning Algorithms week-2	5	234	March 20, 2024
Model Output with and without Softmax Activation / from_logits=True Advanced Learning Algorithms week-2	11	479	June 1, 2023

Improved implementation of softmax regression

Related topics