Why nomalize images

Tapendra_kumar_Vashi · August 9, 2022, 3:03pm

Hi everyone,
In week 3 assignment, why we are dividing image array by 127.5 and also why subtracting by one?

def read_image_tfds(image, bbox):
    image = tf.cast(image, tf.float32)
    shape = tf.shape(image)

    factor_x = tf.cast(shape[1], tf.float32)
    factor_y = tf.cast(shape[0], tf.float32)

    image = tf.image.resize(image, (224, 224,))

    image = image/127.5
    image -= 1

    bbox_list = [bbox[0] / factor_x , 
                 bbox[1] / factor_y, 
                 bbox[2] / factor_x , 
                 bbox[3] / factor_y]
    
    return image, bbox_list

gent.spah · August 9, 2022, 3:36pm

In this case the pixel values will be in between [-1, 1]. The main purpose of doing such normalizations is so that the neural network can converge faster because the weights do now oscillate much in magnitude, and computation power used is less compared when you are dealing with bigger numbers.

Tapendra_kumar_Vashi · August 9, 2022, 3:58pm

Thank you for the answers. I have one more question.
Is there any reason of choosing pixel range [-1, 1] over range [0,1] or this choice is arbitrary?

gent.spah · August 10, 2022, 11:58am

I don’t think there is much difference between the ranges [-1, 1] and [0, 1] in terms of computation and convergence there are the same just the sign changes. I suspect that by using [-1, 1] there is better separation of pixel values and might be that this kind of normalization is used for some other separation downstream, but these are just suspicions, I would have to go through and study that thoroughly to give a more precise answer…

Anivader · October 13, 2022, 8:22pm

I agree and feel that [0,1] or [-1,1] should not matter much.
One thought here is that with [0,1], the darkest pixels would definitively get scaled to “0”, which could falsely trick the Network into thinking that these pixels are not important. An easy way to look at this would be to look at the activation function g(w*x + b). If x=0, Relu will give you “0” which implies no learning.
Of course, you could have “0” pixels with the [-1,1] normalization but since this range is stretched, the probability of that happening is way lower.
Let me know if this makes sense.

Topic		Replies	Views
Why normalize pixel locations Advanced Computer Vision with TensorFlow week-1	2	540	August 9, 2022
Week3, Normalizing components of image tensor Improving Deep Neural Networks: Hyperparameter tun	2	500	July 21, 2021
[C1_W2_Assignment] Need explanation for the "show_tensor_images" function Build Basic Generative Adversarial Networks week-2 , week-3	3	508	November 29, 2022
Question about Lab 3 read_image_tfds() Advanced Computer Vision with TensorFlow week-1	3	536	November 24, 2021
Caltech Bird Detector Advanced Computer Vision with TensorFlow week-1	1	564	December 6, 2021

Why nomalize images

Related topics