Course 2 Week 3, Question on Batch Normalization addressing Covariate Shift

daHsu · June 16, 2022, 6:28pm

Hi, in the introduction of Batch Normalization the professor mentions how a classifier trained on recognizing cats would not do very well if it was trained on black cats and had to test on colored cats. I believe Batch Normalization can address this by normalizing the inputs within their data set.

However, in the video on how to determine the final mean/variance to scale test data by, the professor mentions using an exponentially weighted average of the mean/variance of the mini batches. This makes sense if the test data was coming from the same distribution. I was wondering, however, if the data was like the cat example (training=black cats, test=colored cats), if this method of obtaining the final mean/variance would still work? My intuition, if I were to apply this model to the real world, would be to collect colored cat data to obtain a new mean/variance, and that should allow the model to be translated over much better. Is that something that is done in practice?

Elemento · June 17, 2022, 5:16am

Hey @daHsu,
Welcome to the community. I guess Prof Andrew clearly explains why Batch Normalization would work in the case of Covariate Shift, in the video entitled “Why does Batch Norm work?”.

Here, when I say “batch normalization”, I also include the concept of using exponentially weighted average to calculate \gamma and \beta. I would suggest you to review the video once again. I hope this helps.

Regards,
Elemento

Topic		Replies	Views
Batch Normilization Improving Deep Neural Networks: Hyperparameter tun	3	504	April 17, 2023
Batch Normalization Intuition Improving Deep Neural Networks: Hyperparameter tun	1	572	November 22, 2022
Batch Norm and Covarient shift Improving Deep Neural Networks: Hyperparameter tun week-3	4	20	September 14, 2024
Week 3: Why Batch Norm Works Improving Deep Neural Networks: Hyperparameter tun	6	592	October 26, 2021
Batch Normalization Intuition questions Improving Deep Neural Networks: Hyperparameter tun week-3	8	47	July 19, 2024

Course 2 Week 3, Question on Batch Normalization addressing Covariate Shift

Related topics