Course 4, Week 1, Assignment 2: Why is batch normalization applied (only) to axis 3?

nvarma · May 8, 2022, 11:18pm

In Assignment 2 of Week 1 using tensorflow sequential model (exercise 1), we are asked to use batch normalization to axis 3. Why is that?

Reading the documentation, axis should be used for features. In this case, wouldn’t the features be along two axes (1 and 2)?

And specifically, since inputs are 2 dimensional (for each channel), won’t there be two axes along which we would have to normalize it?

I am sure I am misunderstanding something here. Any help is appreciated.

balaji.ambresh · May 9, 2022, 6:30am

Have you seen this page? Read the description of axis keeping in mind that in this assignment, channels are the last dimension.

nvarma · May 10, 2022, 4:25am

Thank you Balaji.

Yes, I had seen that but working through the assignments it seems like the channel axis along which the features are stacked needs to be specified.

This is what confused me:
" axis Integer, the axis that should be normalized (typically the features axis). For instance, after a Conv2D layer with data_format="channels_first" , set axis=1 in BatchNormalization ."

Topic		Replies	Views
TF batch norm for CNNs question Convolutional Neural Networks coursera-platform	5	477	May 24, 2023
Course 4 Week 2 Assignment 1: Axis =3 Convolutional Neural Networks coursera-platform	1	493	March 18, 2023
C4 W1 A2: Why is axis given as 3 for BatchNormalization Convolutional Neural Networks coursera-platform	3	556	September 25, 2021
Week 1 Assignment 2 - BatchNormalization Axis=3 Convolutional Neural Networks week-1 , coursera-platform	1	115	April 29, 2024
Batch Normalization with axis = -1 (3) Convolutional Neural Networks coursera-platform	1	547	October 27, 2021

Course 4, Week 1, Assignment 2: Why is batch normalization applied (only) to axis 3?

Related topics