Understanding loss in tensorflow

Samhita_V · January 29, 2024, 1:05pm

I am trying to calculate L2 loss between features maps of the deepest classifier and each shallow classifier in a CNN network. I am having trouble understanding how this loss should be fed back to shallow feature maps

carlosrl · January 29, 2024, 1:57pm

Hi @Samhita_V
This is not a question related to a course’s exercise and so, I am assuming a general question.
Speaking generically, you can calculate the L2 loss between feature maps and propagate this loss back to the shallow feature maps. Here is a snippet code representation of what I am trying to say:

input = ...
target = ...
# create a criterion for the L2 loss
criterion = tf.keras.losses.MeanSquaredError()
# compute the loss between the input and target
loss = criterion(input, target)
# get the model optimizer
optimizer = ...
# compute the gradients
grads = tf.GradientTape().gradient(loss, model.trainable_variables)
# apply the gradients to update the weights
optimizer.apply_gradients(zip(grads, model.trainable_variables))

Keep learning!

Samhita_V · January 29, 2024, 2:17pm

Can you explain what is happeneing here? Can we not apply model.compile directly and use custom loss?
grads = tf.GradientTape().gradient(loss, model.trainable_variables)
optimizer.apply_gradients(zip(grads, model.trainable_variables))
Also since input and target will have different shapes those will have to be adjusted too?

carlosrl · January 29, 2024, 6:58pm

The question was to try to understand how the loss works. So, in the snippet code you are manually computing and applying gradients. Essentially, the snippet code is doing what model.compile() and model.fit() do together, but with more control over the individual steps, which makes more easy to understand the process. You can get more details in the link below.

Keep learning!

Samhita_V · January 29, 2024, 7:29pm

alright thank you sm!

Deepti_Prasad · January 29, 2024, 7:46pm

Hello @Samhita_V

Is your question more related to the TensorFlow Advanced Techniques Specialisation?

As I can see based on the mention of gradient Tape

Samhita_V · January 30, 2024, 11:38am

yes it was!

Topic		Replies	Views
Question of week 3 programming exercise: How to use optimizer.minimize Improving Deep Neural Networks: Hyperparameter tun week-module-3 , coursera-platform	2	263	January 26, 2024
A doubt in Week 3 Assignment Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	445	September 4, 2023
W4L2. About train_loss and test_loss Custom and Distributed Training with TF week-module-4	5	553	September 2, 2022
Breast Cancer Prediction Gradient problem Custom and Distributed Training with TF week-module-2	2	540	October 31, 2022
How do we compute loss between the outputs of two layers? AI Discussions ai-discussions	6	167	February 5, 2024

Understanding loss in tensorflow

Related topics