Regarding batch normalization

Nithin · May 7, 2021, 6:05pm

In week 1 assignment 2 in sequential layer we used BatchNormalization with argument axis=3, what exactly this axis=3 mean and why are we using batch normalization ?
Also after executing happy_model.summary() I found 64 non trainable parameters what are these 64 non trainable parameters?
Thanks!

reinoudbosch · May 8, 2021, 2:46am

Hi Nithin,

With regard to your first question have a look here: [Week 2] what is the meaning of (axis = 3) in the BatchNormalization? - #6 by reinoudbosch

The 64 non trainable parameters are the mean and variance vectors of the batchnorm layer. With 32 features (axis=3 of the conv2d layer output) you get 32*2 non trainable parameters. You can also have a look here: Keras - number of parameters in BatchNorm Layer

Nithin · May 8, 2021, 12:15pm

Thank for answering,
My second doubt got clarified but…
Can u tell me the difference between axis=2 and axis=3
Thanks!

reinoudbosch · May 8, 2021, 3:59pm

Hi Nithin,

Axis=0 refers to the training examples. Axis=1 and 2 describe 2D arrays with activation values that support the extraction of particular features. Axis=3 is the axis along which the 2D arrays (one array per filter) are stacked. In order to normalize values per feature, you want therefore to normalize along axis 3.

Nithin · May 8, 2021, 4:53pm

My doubt is clarified
thank you!

James_Nathan_O · August 16, 2021, 1:18pm

wait why is it that in the sequential api we use batch normalization but we don’t in the functional api ?

TMosh · August 16, 2021, 5:15pm

It likely has to do with the problem that is being solved, rather than the method being used. Perhaps batch normalization wasn’t required in the exercise that used the functional API.

Topic		Replies	Views
C4 W1 A2: Why is axis given as 3 for BatchNormalization Convolutional Neural Networks coursera-platform	3	556	September 25, 2021
Course 4 Week 2 Assignment 1 BatchNormalization Question Convolutional Neural Networks coursera-platform	2	490	April 21, 2023
C4,W1,A2 E1 BatchNormalization for axis 3 Convolutional Neural Networks coursera-platform	5	543	November 29, 2022
Course 4, Week 1, Assignment 2: Why is batch normalization applied (only) to axis 3? Convolutional Neural Networks coursera-platform	2	524	May 10, 2022
C4W2 Resnets - Why batchnormalization axis = 3 Convolutional Neural Networks coursera-platform	3	616	August 20, 2021

Regarding batch normalization

Related topics