Course 4 Week3: U Net Assignment Doubt: Details about Preprocessing

megha · August 11, 2021, 1:39pm

I am a little confused about the pre processing part of data in W4C3A2. I believe the dataset used uses RGBA images.

Then why do we specify the number of channels as 3? Also, what exactly are we trying to do with the masked images in the preprocess_path function(i.e. what how does tf.math.reduce_max(mask, axis=-1, keepdims=True) help/how will getting the maximum along the last axis help us?)

Lastly, why are we reshaping it to (96,128)?
TIA

megha · August 12, 2021, 7:16am

@Mubsi hey, I saw you’re one of the mentors. Could I get some help?

XpRienzo · August 12, 2021, 10:31am

We’re effectively getting rid of the alpha channel by decoding the png to just three channels since we don’t need transparency information for what we’re doing.
While U-Net is fully convolutional and can use any sized image, we’d definitely want to consider performance, the assignment uses 96x128 for that particular reason from my understanding.

I will get back to you on why we’re doing the max in a bit.

XpRienzo · August 12, 2021, 2:57pm

Now for the max used in the mask we just need one channel rather than all three channels. So what we’re doing here is taking the highest value in any of the rgb channels for any particular pixel (this is why we’re doing the max along the last axis).

Kaiju · December 26, 2021, 10:14am

Do I understand correctly, that for the mask images, we select the highest value regardless of the color R, G, or B. Why do we need to reduce the dimension from 3 channels to just 1? Does this create a risk of overlap? e.g. two classes having the same highest “R” value but different G and B values?

sjking · June 27, 2023, 5:03pm

I had the same question. Unless I am missing something, there appears to be an implicit assumption about the RGB color values of the mask. For example a Purple mask and an Orange mask might both have the same red value, and if that was the max value for each of those colors then I think that it would produce the same class.

Topic		Replies	Views
DLS Course 4 - Week 3: U-Net Image Segmentation Assignment Convolutional Neural Networks	4	519	November 14, 2022
Week 3, Programming Assignment 2, section 2.2 Convolutional Neural Networks	2	350	September 26, 2023
Week 3 Image_segmentation_Unet Preprocess may have a bug Convolutional Neural Networks	2	571	August 1, 2021
U-net exercise: a doubt about image shapes Convolutional Neural Networks	1	519	May 4, 2022
Week 3, programming assignment 2: how does this code snippet work? Convolutional Neural Networks	2	485	September 5, 2022

Course 4 Week3: U Net Assignment Doubt: Details about Preprocessing

Related topics