Loss in semantic segmentation - C4 - Week 3

leodemachado · February 17, 2022, 4:53pm

Friends,
I still do not understand the loss function used for the semantic segmentation using U-Net.

Sparse categorical cross entropy receives a volume which is (None, Width, height, num_classes). It does so because this loss function makes use of tf.argmax() to turn it into a one-layer labeled image. Is that correct?

In the example, it is used only accuracy metric.

But what if I want to check IoU, Dice, and so on?
I tried, but it complains that the output of the model has a different shape.

Any clues?

paulinpaloalto · May 24, 2022, 8:01pm

Yes, the output of the model gives a softmax value at each pixel of the image. That’s the key point about image segmentation: the classification output is per pixel. So you use categorical cross entropy as always with softmax output, but the difference is that it is per pixel, not a single output for the entire image.

Topic		Replies	Views
Course 4 week 3 Unet - Loss Function Convolutional Neural Networks coursera-platform	2	552	October 22, 2021
The choice of loss function and activation function AI for Medical Diagnosis week-module-3	2	637	March 10, 2023
U-net assignment: Confused about Y_train dimensions Convolutional Neural Networks coursera-platform	4	545	April 10, 2022
Week 3 - UNET_V4 - Training Error - "Shapes (None, 96, 128, 1) and (None, 96, 128, 23) are incompatible" Convolutional Neural Networks week-module-3 , coursera-platform	9	355	February 3, 2024
Loss Function for Semantic Segmentation Convolutional Neural Networks coursera-platform	3	361	September 20, 2023

Loss in semantic segmentation - C4 - Week 3

Related topics