C2_W3_Transfer Learning

jaejun02 · April 11, 2024, 3:21pm

Hello, it was mentioned in the lecture that the data used for your own application and data used for pretrained model should be the same to make things work for training data.

I would like to ask how “same” the data should be. For example, to train a digit classifying NN, is it sufficient for the pretrained model to be trained on an image, or do they have to have a same size? I’m asking this because I was thinking about what if I have a different input dimension (especially for things like audio or text as we might extract different features)? How should I proceed with transfer learning?

Thanks.

TMosh · April 11, 2024, 3:30pm

All the images should be the same size.

jaejun02 · April 11, 2024, 3:32pm

Thanks for reply! However, if that’s a condition, wouldn’t it be too hard to used a pretrained model trained by someone else? It is quite natural that your dataset would have a different input dimension.
Or can be try adding a simple layer in front such that our input will match the input to the pretrained data after going through the layer?

Thanks!

TMosh · April 11, 2024, 3:45pm

You would need to pre-process (i.e. resize) the images before you can use them.

MINTC · April 11, 2024, 4:28pm

A similar distribution of data is necessary for transfer learning. Resize photos to fit the input scale of the pre-trained model. Make sure that the data structure for text or audio matches the expectations of the model. Adjust the model to make it more responsive to particular features using your dataset. For best results, track performance and modify the parameters as necessary.

rmwkwok · April 12, 2024, 1:31am

Hello @jaejun02,

We are multiplying matrices in neural network, and if the shape does not match, the multiplication won’t work. In principle, you may add an additional layer in the front to bridge the shapes and see how it goes, but for image, it is very common and is easier to just resize the image to the right size and the result usually isn’t bad.

Cheers,
Raymond

pastorsoto · April 12, 2024, 2:06am

Hi @jaejun02 this is a great question! I had the same question when I wanted to start using pre-trained models, there are several approach that you can take for this. It is true you need to pre-process your data to have the same size, that’s why some pre-trained models are better for some task than others, you can customize some pre-trained models to use your own data, it might be a more difficult approach but it is possible.

I hope this helps!

Topic		Replies	Views
3 Questions on transfer learning Advanced Learning Algorithms week-module-3	3	570	September 21, 2022
Transfer learning input layer size Advanced Learning Algorithms week-module-3	1	498	July 19, 2022
Transfer learning: data diversity for pre-training step Advanced Learning Algorithms week-module-3	3	474	February 26, 2023
Week 2: Transfer learning - input layer size Convolutional Neural Networks coursera-platform	1	551	October 3, 2021
C4W4A2, VGG19 model input shape Convolutional Neural Networks coursera-platform	3	528	May 27, 2023

C2_W3_Transfer Learning

Related topics