C2_W2_Transfer_Learning

bellma03 · December 10, 2021, 8:16pm

In the week 2 Transfer Learning assignment I understand the logic behind freezing the earlier layers and unfreezing the last “X” layers such that the network can pick up on your domains intricacies.

What I’m not really understanding is the second part of this next statement that is found in the Transfer Learning notebook saying “Set training in base_model to False to avoid keeping track of statistics in the batch norm layer”.

Having training in ‘base_model’ set to False avoids changing the weights and only trains the new layers which makes sense. It’s the last part around “keeping track of statistics in the batch norm layer” where I am lost.

Any help in understanding this would be greatly appreciated!

paulinpaloalto · December 10, 2021, 10:57pm

The training argument to BatchNorm is a different thing that setting the overall model to be not trainable. Take a look at the docs for Keras BatchNormalization. We also saw that argument being used more explicitly in the previous assignment about Residual Networks.

bellma03 · December 13, 2021, 3:50am

Okay, so the statistics that are being referenced are the mean and variance used to normalize the current batch. Thank you for that link, that was exactly what I was looking for!

Topic		Replies	Views
Week 2 Assignment 2 Why do we set base_model(training=False)? Convolutional Neural Networks coursera-platform	4	774	December 20, 2022
Assignment-2 week-2, batch normalization layer Convolutional Neural Networks coursera-platform	3	577	December 25, 2021
Questions Week2 Assignment2 "Transfer Learning" Convolutional Neural Networks coursera-platform	3	536	February 14, 2023
What is meant by "When freezing layers avoid keeping track of statistics (like in the batch normalization layer)"? Convolutional Neural Networks coursera-platform	4	825	April 23, 2023
Course 4 Week 2: Programming Assignment ALPACA Convolutional Neural Networks coursera-platform	1	545	August 29, 2022

C2_W2_Transfer_Learning

Related topics