DLS 2 Week 3_Exercise_6_compute_cost()_ERROR

muhammadahmad · July 24, 2021, 2:49pm

I have tried multiple variations with the logits and labels but the output isn’t according to the expected output. Please guide me through this issue.

** Simple **

cost = tf.reduce_mean(tf.keras.losses.categorical_crossentropy(y_true = labels,y_pred = logits))

OUTPUT:

tf.Tensor(0.147125, shape=(), dtype=float32)

EXPECTED_OUTPUT:

tf.Tensor(0.8419182681095858, shape=(), dtype=float64)

**** After Reshaping ****

logits = tf.cast(tf.reshape(logits,(logits.shape[1],logits.shape[0])),dtype=tf.float64)
labels = tf.cast(tf.reshape(labels,(labels.shape[1],labels.shape[0])),dtype=tf.float64)
cost = tf.reduce_mean(tf.keras.losses.categorical_crossentropy(y_true = labels,y_pred = logits))

OUTPUT:

tf.Tensor(0.643947425697558, shape=(), dtype=float64)

EXPECTED_OUTPUT:

tf.Tensor(0.8419182681095858, shape=(), dtype=float64)

Mubsi · July 24, 2021, 6:54pm

Hey @muhammadahmad, the shapes of labels and logits you are passing is not in the shape as they are expected. You need to take transpose of those before passing. Reading the document always helps.

muhammadahmad · July 24, 2021, 7:02pm

@Mubsi

Thanks it worked. The expected output mentioned in the assignment is not correct and after evaluation it gave me 80/100 but I have passed all the tests so kindly check those bugs.

Mubsi · July 24, 2021, 7:09pm

Hey @muhammadahmad , thank you for informing about this. Can you DM your notebook ? Thanks.

Mubsi · July 24, 2021, 7:13pm

Also, I have removed your mark of my post as “solution”, because I gave you the answer directly. We encourage learners to read the documentation. We want for them to figure this on their own.

paulinpaloalto · July 24, 2021, 7:52pm

I just finished going through the new version of this assignment also. There are a number of incorrect “expected values”. I can file the GitIssue on this if you want.

Mubsi · July 24, 2021, 8:08pm

Yes please. that’d be really helpful @paulinpaloalto ! You can comment them under my issue title “investigate expected values”

paulinpaloalto · July 24, 2021, 8:13pm

I just created a GitIssue with the things that I noticed.

Mubsi · July 24, 2021, 8:14pm

Thanks, @paulinpaloalto !

jeffreywang · July 25, 2021, 9:14pm

Is this issue fixed? I am having the exact same issue as mentioned above.

paulinpaloalto · July 26, 2021, 12:01am

There is a new version of the notebook which has the correct “expected value” for the compute cost test case. It should show as:

Expected output

tf.Tensor(0.4051435, shape=(), dtype=float32)

If it doesn’t, then you don’t have the latest version. If the value is correct, but your code produces a different value then you have a bug that you need to find. Common mistakes are:

Using binary_crossentropy loss instead of categorical_crossentropy.
Missing the instructions about the shapes that are needed by categorical cross entropy.
Forgetting to specify the correct value of the from_logits parameter to the loss function.

jeffreywang · July 26, 2021, 2:56am

Hi Paulin,

Thank you for the comment—it is very useful and an apt summary of the mini bloom of discourse issues surrounding this issue in recent days.

What is the third bullet point in that comment? I do have the 0.405 expected value, have already reshaped the values (tried both tf.transpose and tf.reshape), and checked that I am using categorical cross entropy. Yet, I am still not getting the right value.

I don’t see a “from_logits” parameter anywhere in the notebook. What are you referring to?

paulinpaloalto · July 26, 2021, 3:00am

I guess they must have removed some text from the notebook. The point is that all the loss functions support the idea that you leave out the activation function on the output layer and then let the loss function do both the activation (sigmoid or softmax, depending on whether it’s a binary or multiclass classification) and the “log loss” calculation as a bundled operation. It turns out that doing it that way is both more efficient (one less TF call) and more numerically stable.

Notice that we are told not to include the output activation on the last layer in forward_propagation. So we need to specify from_logits = True to tell the loss function we are giving it inputs that are not the activation outputs. That is an optional parameter to the loss functions and the default value is False.

jeffreywang · July 26, 2021, 3:06am

I see… just to see if I understand this correctly: in forward_propagation we only calculate Z3… because the categorical_crossentropy function includes the activation function AND loss function inside it?

**It would probably also be useful to add that from_logits note in the notebook!

paulinpaloalto · July 26, 2021, 3:55am

Yes, that’s the correct interpretation. You need to specify from_logits = True on the loss function to tell it to do the activation internally. That is optional, but it is the way Prof Ng always has us do things. As I mentioned above, it’s less code and it’s more numerically stable, so why wouldn’t you do it that way?

BTW the from_logits parameter is described on the documentation pages for categorical_crossentropy and binary_crossentropy. You have read those, right?

andros · July 26, 2021, 11:44am

I cannot understand why I’m still getting error using this code:

{moderator edit - solution code removed}

this is the result value:

tf.Tensor(0.028503546, shape=(), dtype=float32)

edit:

ok, I understood the mistake: I need to transpose logits and labels before call tf.keras.losses.categorical_crossentropy.

paulinpaloalto · July 26, 2021, 2:34pm

@andros: It’s good to hear that you found the solution under your own power. If you passed the test, I assume you also figured out that you have the order of the arguments backwards for labels and logits.

OkmiSantos · July 31, 2021, 2:30am

Thank you so very much this message I Help me much!

thangngxuan · November 24, 2021, 10:43am

cost = tf.keras.losses.categorical_crossentropy(tf.transpose(labels), tf.transpose(logits),from_logits=True)
tf.reduce_mean(cost)
I had to transpose logits and labels before using tf.keras.losses.categorical_crossentropy, but it not work. Can anyone help me?

paulinpaloalto · November 24, 2021, 4:12pm

The first line looks correct to me. But notice that you don’t assign the output to a tensor for the reduce_sum. I tried that and I end up with a 2 element 1D tensor, even if I also supply the argument axis = None (which the documentation says is the default). The documentation makes it sound like this should work, but my observation is that it does not. Perhaps this is because we are running in “Eager” mode …

The test cell prints the value of cost. What do you see with your code?

Topic		Replies	Views
DLS 2 week 3 exercise 6 compute_cost Improving Deep Neural Networks: Hyperparameter tun coursera-platform	41	4044	September 20, 2024
Week 3 - Exercise 6 - compute_cost, getting incorrect mean cost Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	749	October 18, 2021
Exercise 6 compute_cost Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	542	August 21, 2022
Course2, Week3, Exercise 6 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	696	August 14, 2021
Test does not match. Did you get the reduce sum of your cost functions? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	18	1005	November 15, 2022

DLS 2 Week 3_Exercise_6_compute_cost()_ERROR

Related topics