Fine tuning and further pretrained model training when batchnorm layers are blocked

Nanini · January 16, 2024, 12:28am

sequence model course 4, week 3, the last assignment : Trigger_word_detection_v2a

When we fit the model (see concerned section inside) it is said that when we fine tune a pretrained model, we block the weights of batchnorm layers such that they are not trainable anymore. Then, it is said that we can continue to train the model further using Adam optimizer and binary-cross entropy loss.

My question is : is fine tuning and continuing to train a pretrained model considered as being the same here ? or in general ? because we used a pretrained model where the layers of the batchnorm are not trainable (because we blocked them) how is then the model able to train actually ?

Thank you very much

Best

TMosh · January 16, 2024, 1:50am

Yes.

Typically you’ll make the early layers not trainable, and only fine tune the final layers.

Topic		Replies	Views
Week 3 Assignment 2 section 2.2.1 - Block training for Batch Norm layers Sequence Models	3	525	May 26, 2022
W3A2- Trigger Word Detection -Block Training for BatchNormalization Layers Sequence Models	1	504	April 21, 2022
Questions Week2 Assignment2 "Transfer Learning" Convolutional Neural Networks	3	525	February 14, 2023
C2_W2_Transfer_Learning Convolutional Neural Networks	2	550	December 13, 2021
Course 4 Week 2: Programming Assignment ALPACA Convolutional Neural Networks	1	536	August 29, 2022

Fine tuning and further pretrained model training when batchnorm layers are blocked

Related topics