Questions about transfer learning?

sunson29 · April 26, 2023, 4:48pm

Dear friend, and mentor,

I’ve already learned that, if my final target NN is Z, and well-trained NN A is strongly related with target Z, then I just freeze most of the layers in A, and only train the last few layers in A for Z. That’s how we inherit the A’s “knowledge”, this is one to one (A transfer to Z) transfer learning.

Now, my question is that,
almost everything is the same, but I want to inherit or transfer 2 well-trained NN (lets say A, B) for Z. What should I do about it ? In other words, this transfer learning is not one to one, but many to one (A,B to Z) now. Any master could give me some hint or some key words I could google about it at least ?

gent.spah · April 26, 2023, 5:47pm

Its a tricky proccess I am not sure it can be done! In the MLops specialization they introduce some transfer learning techniques like Teacher-Student models, I think its course 4, go and and have a look at it.

But to infuse many networks combined to one I dont know if it can be done because how would you merge the many different architectures together and their different parameters.

sunson29 · April 26, 2023, 5:58pm

That’s the reason I am asking, I already done that class, that’s how I know this one to one transfer learning.

I believe there must be a way to do this. Maybe not the same way like before which only train the last few layers, but whatever new strategic to do this.

TMosh · April 27, 2023, 3:44pm

I’ve not heard of any method to do that.

sunson29 · April 28, 2023, 3:43pm

May I ask, if I have well trained NN A and B, now, I just want to “combine” then together to become my new NN Z (Z = A “+” B), is there a way to do that?

TMosh · April 28, 2023, 3:59pm

Not that I know of.

Can you specifically define what you mean by “combine them”? What’s the goal specifically.

sunson29 · April 28, 2023, 4:17pm

Maybe I can rephase and split my question into 2 stages.

Stage 1: for example, if the NN A can classify cat and dog very well. Another NN B can do classification between human male and female very well. A, and B are well trained, the NN structure are even the same.

Now, my new mission is to do the classification among cat, dog, male, female. So, is there a way I can do it through A, and B ? that’s what I meant, is there a way to “combine” A and B? or what are the strategies to do this ? Of course, I am not talking about retraining everything. Let’s assume, those NN A & B, I just download somewhere, and the resource doesn’t give me the training samples.

Stage 2: well, now I want to go to my final target, add to one more class, maybe alien bugs. In other words, this final NN Z can classify 5 classes after all ( cat, dog, human male, female, and alien bugs). Here, I do assume i have few training sample for the alien bugs. As you can tell, If I do can do stage 1, then this stage 2 become the standard transfer learning. I just train the last few layers of “combined A+B”.

ai_curious · April 28, 2023, 5:06pm

I believe the answer is Concatenate

Will look for some examples of models being merged through this approach

sunson29 · April 28, 2023, 5:12pm

Concatenate Hmm. looks so easy. lol. Sounds like I just concatenate the layers nn_a and nn_b together, that’s it ? so this nn_a + nn_b will truly do the work classify cat, dog, male, female very well? thank you

ai_curious · April 28, 2023, 5:15pm

See the section

Manipulate complex graph topologies

Models with multiple inputs and outputs

Essentially that shows three models being merged using Concatenate. The code is available at the linked page. In Keras, the Model class inherits from/extends the Layer class. So you can treat Model instances like Layer instances to do things like this. In Object-Oriented lingo, Model is-a Layer. Note that the inverse relationship is not true, that is, you cannot say Layer is-a Model. In the example at this page, the author just uses Layers directly, but it’s a trivial extension to make each one a Model, using either the Sequential or Functional API, and then use the Functional model to Concatenate. Usual restrictions on freezing parameters if you’re subsequently doing further training on a merged architecture.

sunson29 · April 28, 2023, 6:11pm

thank you very much. I am not 100% following right now. But I will read more.

TMosh · April 28, 2023, 6:40pm

@ai_curious, thanks for the reference.

ai_curious · April 28, 2023, 6:55pm

Transfer learning generally involves freezing some or all layers of the existing model so that you don’t lose its training when it is incorporated into a new model. In the example linked above, all three channels are training on their respective inputs before being concatenated; it isn’t a transfer learning example. To adapt it to your hypothetical, each channel would be trained as a separate model, loaded as 3 separate models, trainable parameters frozen, then concatenated. Read more about Keras models and transfer learning here:

Key idea is this…Layers & models also feature a boolean attribute trainable . Its value can be changed.

Topic		Replies	Views
Questions about transfer learning in "Transfer_learning_with_MobileNet_v1" Convolutional Neural Networks coursera-platform	10	790	December 3, 2022
Create a new model using tranfer learning. It works.... but why? Convolutional Neural Networks in TensorFlow week-module-2	3	481	March 31, 2023
Why model = Model(pre_trained_model.input, x) would include the layers before 'mixed7'? Convolutional Neural Networks in TensorFlow week-module-3	26	412	March 4, 2024
C3_W1_Transfer Learning Advanced Computer Vision with TensorFlow week-module-1	5	579	May 5, 2022
Understanding Transfer learning Convolutional Neural Networks coursera-platform	4	578	August 29, 2022

Questions about transfer learning?

Manipulate complex graph topologies

Models with multiple inputs and outputs

Related topics