Confusion with Input normalization and batch normalization

avinash_pant · January 21, 2022, 7:14am

When applying batch normalization to the hidden layers in a mini batch, we use the mean and variances of that particular mini-batch only. But I am confused about the normalization of the input layer (X), should we normalize the input layer before creating the mini-batches by using the mean and variance of the whole training set or normalize the input layer after creating the mini- batches by using the mean and variance of that particular mini-batch just like we do for the hidden layers?
Hope you got my point.

balaji.ambresh · January 21, 2022, 7:08pm

In practice, we normalize inputs before feeding to the NN using using the mean and standard deviation of the training set. Batching is done after normalizing data.

The mean and standard deviation calculated using the training set are used for normalizing the validation and test sets.

SomeshChatterjee · January 22, 2022, 5:06am

Hi Avinash,

Like Balaji mentioned, you should always normalize input data before you feed it into a neural network (with certain rare exceptions). Whether you apply batch normalization in your neural networks is a secondary thing. Even if you do apply batch normalization, they are not equivalent, as in batch normalization you are just focusing on the data in the batch. A data normalized per batch/mini-batch is very different from data normalized as a whole.

To understand this more, you need to understand the underlying objective of the 2 operations. Normalization is focused on bringing different features to a common scale while maintaining the relationships between them while batch-normalization focusses on stabilizing the learning process. Batch normalization does not fulfil the objective achieved by normalization.

avinash_pant · January 22, 2022, 1:35pm

Thanks for the clarification, I get your point now.

Topic		Replies	Views
Batch Normalization vs Feature Input Normalization Improving Deep Neural Networks: Hyperparameter tun	3	630	May 24, 2021
Questions on batch normalization Improving Deep Neural Networks: Hyperparameter tun	3	365	September 27, 2023
Why do we run BatchNormalization after Conv2D? Convolutional Neural Networks	3	588	December 31, 2022
Batch Normalization Intuition questions Improving Deep Neural Networks: Hyperparameter tun week-3	8	47	July 19, 2024
Batch Normalization Or Batch Standardization Improving Deep Neural Networks: Hyperparameter tun	1	573	July 12, 2021

Confusion with Input normalization and batch normalization

Related topics