C2_W2_SoftMax lab

tarunsaxena1000 · March 20, 2024, 12:52pm

Hey, Everyone i have a little doubt
Q1- The selected line of the above notebook, is that correct? Let me also attach chat gpt Answer.

I have one more question-
Q2- if we dont use softmax in the last layer for better computation then how does our model verify output for each epoch and train the weights?

TMosh · March 20, 2024, 3:52pm

Yes, it is correct. This is discussed in the lectures.

I recommend you not use a chat robot for programming advice.

As written, TensorFlow will automatically compute the sigmoid activation and softmax.

Deepti_Prasad · March 20, 2024, 4:10pm

hello @tarunsaxena1000

Tarun refer the below link to understand why from_logits=true holds significance in loss i.r.t.softmax activation when not used in the last dense layer in model architecture.

feel free to ask any more doubts.

regards
DP

Deepti_Prasad · March 20, 2024, 4:14pm

also I agree with Tom on use of chatbot, you could rather explored tensorflow

https://www.tensorflow.org/api_docs/python/tf/keras/losses/SparseCategoricalCrossentropy

tarunsaxena1000 · March 20, 2024, 5:41pm

So what you guys are saying is if the final layer is linear activation, and we do

from_logits = True.

Then the loss function will expect the logits ranging from (-infinity,+infinity) and internally apply softmax for the calculation of loss from y_train which ranges from [0,N])
(Deepti thanks for the article)

Deepti_Prasad · March 20, 2024, 8:02pm

No the understanding should be because we are not using softmax activation in the last dense layer, the loss should include the from_logits to include the raw logits, i.e. softmax activation as the loss choice here is SparseCategoricalcrossentropy which is compatible with softmax activation and Sparsecategoricalcrossentropy is not right choice for linear or sigmoid activation in the last dense layer.

Topic		Replies	Views
C3W1_TensorFlow_Tutorial NLP with Sequence Models week-1	5	314	December 21, 2023
Week 3 - Assignment - compute_total_loss - try to set from_logits=False Improving Deep Neural Networks: Hyperparameter tun	5	15566	July 23, 2023
Improved implementation of softmax regression Advanced Learning Algorithms week-2	3	24	July 28, 2024
Improved implementation of softmax - Neural network training \| Coursera Advanced Learning Algorithms week-2	1	67	June 25, 2024
Week 2 - Improved implementation with SoftMax Advanced Learning Algorithms week-2	10	709	December 1, 2023

C2_W2_SoftMax lab

Related topics