Batch Normalization with axis = -1 (3)

thanhlamtrinh · October 21, 2021, 2:04pm

Hi Coursera community,

Could you guys please help me elaborate on the way to perform batch normalization over the last axis of the input (possibly color channels, filters). As we learned from the lectures regarding Batch Normalization, it goes along the first axis over each feature (# examples, # features). In the case of convolutional network, it is still very unclear for me why we make this setting.

I hope you guys could help me be enlightened more on this issue,

jonaslalin · October 27, 2021, 10:57am

I have answered a similar post previously: