Different Mean/Standard Dev values for hidden units

Christian_Simonis · July 16, 2023, 9:28am

welcome to the community and thanks for your question!

Batch normalization can help to accelerate the training, by aligning batches, so that the training is done more consistently, which should be achieved by tackling the problem of the internal covariance shift (leading to a systematic change in network activations), which is also well outlined in this article: Internal Covariate Shift: How Batch Normalization can speed up Neural Network Training | by Jamie Dowat | Analytics Vidhya | Medium and this paper: [1502.03167] Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift

Can you elaborate a bit more what you mean here specifically? In general: batch normalisation is about ensuring consistency in layer activation so that basically several layers fit well together especially since the weights in the layers change during training and we have different batches of training data. By this we want to make sure the gradient flow works efficiently and gradients are stable (e.g. risk of vanishing gradients is reduced), see also this thread: Vanishing/Exploding Gradients when there is a non-linear activation function - #3 by Christian_Simonis!

Hope that helps.

Best regards
Christian

Topic		Replies	Views
Week 3: Why Batch Norm Works Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	595	October 26, 2021
Batch Normalization Intuition Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	575	November 22, 2022
Batch Normalization vs Feature Input Normalization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	644	May 24, 2021
Batch norm usage understand Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	576	April 30, 2022
Batch Normalization Intuition questions Improving Deep Neural Networks: Hyperparameter tun week-3 , coursera-platform	8	47	July 19, 2024

Different Mean/Standard Dev values for hidden units

Related topics