DLS Cousre 2 Week 3 Compute Loss

thangngxuan · November 24, 2021, 10:49am

cost = tf.keras.losses.categorical_crossentropy(y_true=tf.transpose(labels), y_pred=tf.transpose(logits),from_logits=True)
tf.reduce_mean(cost)

I had to transpose logits and labels before using tf.keras.losses.categorical_crossentropy, but it not work for me. Can anyone help me?

ai_curious · November 24, 2021, 11:36am

CCE.call() expects the following:

y_true: Ground truth values. shape = [batch_size, d0, … dN]
y_pred: The predicted values. shape = [batch_size, d0, … dN]

If you don’t have those shapes, suggest investigate why not. ‘I had to transpose…’ is a red flag

However, I believe the problem is that you are mixing the syntax for calling the constructor (making a new instance of CCE) versus computing the loss on your training data.

Check the example for standalone usage [tf.keras.losses.CategoricalCrossentropy | TensorFlow Core v2.7.0]


cce = tf.keras.losses.CategoricalCrossentropy()
loss = cce(y_true, y_pred)

thangngxuan · November 24, 2021, 12:52pm

Thanks for your help. I try to transpose labels and logits before calling CCE, but it still have an error.

    labels = tf.transpose(labels)
    logits = tf.transpose(logits)
    cost = tf.keras.losses.categorical_crossentropy(y_true=labels, y_pred=logits,from_logits=True)
    tf.reduce_mean(cost)

ai_curious · November 24, 2021, 12:57pm

Compare your code for the line cost = … with the two lines I pasted from the TensorFlow doc

y_true and y_pred are not valid arguments to the CCE constructor
I haven’t looked at the code for this exercise for a long time, but would bet money that transposing the training data is ‘fixing’ the wrong problem. The point isn’t to do the transpose in a different line of code, but that you probably shouldn’t need to be doing it at all.

thangngxuan · November 24, 2021, 1:24pm

yeahh i can fix the problem. It really need to transpose the logits and labels. Because the shape of tensor that passes to tf.keras.losses.categorical_crossentropy is (num_samples, num_classes). The issue in my code comes from tf.reduce mean. It must be cost = tf.reduce_mean(cost) or cost = tf.reduce_mean(tf.keras.losses.categorical_crossentropy(y_true = labels,y_pred = logits, from_logits=True)). BTW also its argument from_logit=True to tell it the softmax calculation.

Topic		Replies	Views
W-3-- Programming Assignment: TensorFlow Introduction Improving Deep Neural Networks: Hyperparameter tun coursera-platform	5	551	August 8, 2022
Week 3 - Exercise 6 - Compute Total Loss Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	1364	April 3, 2024
Course 2, Week 3 compute_total_loss function Improving Deep Neural Networks: Hyperparameter tun coursera-platform	13	665	June 12, 2023
DLS 2 Week 3_Exercise_6_compute_cost()_ERROR Improving Deep Neural Networks: Hyperparameter tun coursera-platform	28	1643	August 28, 2024
W3 c2 tenserflow Improving Deep Neural Networks: Hyperparameter tun coursera-platform	10	741	April 8, 2023

DLS Cousre 2 Week 3 Compute Loss

Related topics