Please help with compute total loss!

Mohammad_Foroughi · February 27, 2023, 5:25am

Hello
i used the tf.keras.losses.categorical_crossentropy and also tf.reduce.sum
also logits and labels are in shape of (6, num_examples) , so i used tf.transpose on then so that i have them in expected shape for categorical_crossentropy function
At the end i devided the the result into unmber of example
still i get a big different
please help
thanks

paulinpaloalto · February 27, 2023, 5:36am

In this function we are computing the sum, not the average of the costs. The other thing to check is to make sure you used the from_logits parameter correctly. We are passing the “logits” here and not the softmax output values, right? Here’s a thread which talks about that and why it is done that way.

Here’s a thread which talks about why it’s the sum, not the average.

Mohammad_Foroughi · February 27, 2023, 1:33pm

Thank you, after using from_logits parameter and removing the deviding part of code, it went smooth.
The reason of my confusion was this part Text in Exercise:

paulinpaloalto · February 27, 2023, 3:38pm

The second link I gave you in my previous reply explains exactly that point. Please have another look at it.

Topic		Replies	Views
Tensorflow_introduction, compute_total_loss() Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	1133	August 21, 2023
Test does not match. Did you get the reduce sum of your cost functions? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	18	1026	November 15, 2022
Week3 programming assignment; compute_total_loss question Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	12	378	January 19, 2024
The computation of the cost function: compute_cost() Improving Deep Neural Networks: Hyperparameter tun coursera-platform	9	771	February 11, 2023
Stuck on C2W3 Assignment: Cost Function Improving Deep Neural Networks: Hyperparameter tun coursera-platform	23	1091	October 3, 2023

Please help with compute total loss!

Related topics