Batch normalization clarification

Noam_Mizrachi · October 8, 2021, 10:28am

Hi,

I managed to understand the batch normalization lecture, but I have some thoughts on the practical side.
Is BN helps in most cases? Is there a correlation to what activations being used? (Relu sigmoid tang …)?
My intuition is that BN helps when the activations are sigmoid or tanh whereas it doesn’t help relu because sigmoid and tanh have max derivative at 0, and approximately 0 derivative at (-inf,inf)
Is that correct?

balaji.ambresh · April 22, 2022, 8:55pm

You can feed BatchNorm output to a relu layer.

Topic		Replies	Views
tf.keras.layers.BatchNormalization() Custom Models, Layers and Loss Functions with TF week-module-4	2	570	February 12, 2023
Batch Normalization vs Feature Input Normalization Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	686	May 24, 2021
Why does batch norm work? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	4	630	September 2, 2023
Is Batchnorm really necessary? Improving Deep Neural Networks: Hyperparameter tun coursera-platform	6	625	July 12, 2022
CNN Batch Normalization Convolutional Neural Networks coursera-platform	1	535	April 29, 2022

Batch normalization clarification

Related topics