Why normalize pixel locations

Tapendra_kumar_Vashi · August 6, 2022, 7:01am

Hi everyone,

In the week 3 lab of mnist digit localization, while transforming the images why we are dividing pixel location like xmax, xmin by 75?

'''
Transforms each image in dataset by pasting it on a 75x75 canvas at random locations.
'''
def read_image_tfds(image, label):
    xmin = tf.random.uniform((), 0 , 48, dtype=tf.int32)
    ymin = tf.random.uniform((), 0 , 48, dtype=tf.int32)
    image = tf.reshape(image, (28,28,1,))
    image = tf.image.pad_to_bounding_box(image, ymin, xmin, 75, 75)
    image = tf.cast(image, tf.float32)/255.0
    xmin = tf.cast(xmin, tf.float32)
    ymin = tf.cast(ymin, tf.float32)
   
    xmax = (xmin + 28) / 75
    ymax = (ymin + 28) / 75
    xmin = xmin / 75
    ymin = ymin / 75
    return image, (tf.one_hot(label, 10), [xmin, ymin, xmax, ymax])

gent.spah · August 7, 2022, 10:04am

This is because you are placing those images in a bigger canvas at 75 by 75 pixels.

Tapendra_kumar_Vashi · August 9, 2022, 3:04pm

Thank you for the reply.
Could you please look into this too.
https://community.deeplearning.ai/t/why-nomalize-images/181632/2

Topic		Replies	Views
Why nomalize images Advanced Computer Vision with TensorFlow week-module-1	4	582	October 13, 2022
Question about Lab 3 read_image_tfds() Advanced Computer Vision with TensorFlow week-module-1	3	536	November 24, 2021
Week3, Normalizing components of image tensor Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	500	July 21, 2021
Week 3 Image_segmentation_Unet Preprocess may have a bug Convolutional Neural Networks coursera-platform	2	571	August 1, 2021
C2_W3_A1 Tensor normalization function Generative AI with Large Language Models coursera-platform	5	12	July 5, 2025

Why normalize pixel locations

Related topics