Normalization layer in EfficientNetB4 architecture

Harshit1097 · June 28, 2023, 5:52pm

EfficientNetB4 model has a rescaling layer followed by a Normalization layer in the beginning. From the official documentation of keras.layers.Normalization I inferred that this layer will shift and scale inputs into a distribution centered around 0 with standard deviation 1. It accomplishes this by “precomputing” the mean and variance of the data, and calling (input - mean) / sqrt(var) at runtime.
I wonder how does it “precompute” mean and std of the input dataset. If I use keras.preprocessing.image.ImageDataGenerator then how and when does the precomputation of mean and std happen?

balaji.ambresh · June 28, 2023, 6:18pm

ImageDataGenerator is an iterator of the underlying images. Pixel values are still floating point numbers. So, computations for calculating mean and variance can continue to happen at the batch level during the training process.

Harshit1097 · June 28, 2023, 7:02pm

Okay. If the training set consists of let’s say 6400 images, and we use a batch size of 64. While training, for each batch, 64 images are read and their mean and standard deviation is computed. The images of that batch are then normalized using these mean and std and the training proceeds further. Am I right?
If this is so, then wouldn’t the mean and std be different for each batch? Also, what values of mean and std will we use to normalize the images in the test set?

balaji.ambresh · June 29, 2023, 4:31am

Please read this where it’s mentioned that you either call adapt or manually supply the mean and variance before invoking the fit method when using keras.layers.Normalization.

There’s one more layer called BatchNormalization which learns mean and variance of the entire dataset on the fly. All you need to do is place this layer within the keras model and let it learn during model training.

Mean and variance should be learnt only from the training dataset. Nothing new should be learnt from the test dataset since it is ONLY meant to evaluate model performance.

Harshit1097 · June 29, 2023, 6:33am

Thankyou for the info @balaji.ambresh !

Topic		Replies	Views
Confusion with Input normalization and batch normalization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	612	January 22, 2022
Is image normalization really necessary? AI Discussions ai-discussions	2	1165	February 13, 2024
C2W3 quiz - understand answer to Question 8 Improving Deep Neural Networks: Hyperparameter tun coursera-platform	8	650	June 1, 2022
Assignment-2 week-2, batch normalization layer Convolutional Neural Networks coursera-platform	3	567	December 25, 2021
Batch Normalization vs Feature Input Normalization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	646	May 24, 2021

Normalization layer in EfficientNetB4 architecture

Related topics