C2W3 Batch Normalization: Why is bias term redundant?

jchia89 · July 31, 2021, 6:40am

In C2W3’s video on implementing batch normalization in neural networks, Prof Andrew Ng mentioned that the bias term is unnecessary since it will be cancelled out during mean subtraction, and it will instead be replaced by the Beta learned parameter. But doesn’t the bias term contribute towards the mean of Z^[l] and removing the bias term would underestimate/overestimate the true mean of Z^[l]?

Perhaps I still can’t grasp Prof Ng’s explanation on why the bias term is rendered unnecessary in that video. Would anyone be able to elaborate more in detail as to why this is the case?
Any help would be appreciated, thanks in advance.

mc04xkf · August 1, 2021, 9:12am

underestimate/overestimate the true mean of Z does not affect z_{norm}^{(i)}

ansonchantf · February 28, 2024, 3:50am

Hi, can anyone explain this more? I have the same question.

rmwkwok · February 28, 2024, 4:17am

Hello @ansonchantf,

Let’s say this neuron has this equation: 3x + 7, and let’s say in this batch we have 10 samples [ 6, 7, 8, 9, 0, 1, 2, 3, 4, 5 ]. What will the outcome be after batch normalization?

If you change 3x + 7 into 3x + 700, will the outcome change because of the change in the bias term?

Cheers,
Raymond

ansonchantf · February 28, 2024, 11:01pm

Thank you @rmwkwok
I just tried to input the numbers and the result shows no difference for the change of bias term. Appreciate that!

rmwkwok · February 29, 2024, 12:36am

You are welcome, @ansonchantf!

Cheers!

Topic		Replies	Views
Course 2 Week 3 Removing the Bias term in Batch Norm Improving Deep Neural Networks: Hyperparameter tun coursera-platform	1	386	August 8, 2023
Week 3 quiz - misleading question Improving Deep Neural Networks: Hyperparameter tun week-3 , coursera-platform	3	208	August 1, 2024
Course 2 Course 3 Quiz Clarification Improving Deep Neural Networks: Hyperparameter tun coursera-platform	2	513	April 21, 2023
The need for a bias term AI Discussions	1	56	May 17, 2021
Week 3: Batch-Normalization confusion Improving Deep Neural Networks: Hyperparameter tun coursera-platform	3	616	May 29, 2022

C2W3 Batch Normalization: Why is bias term redundant?

Related topics