About Transfer Learning Limitations

Disturbinsilence · August 8, 2021, 10:14am

Hello, Prof Andrew Ng said that “unless you have large dataset and large computational margin,” we have to train the network from scratch. My question is on what data range is the transfer learning only applicable? Thank you!

arosacastillo · August 27, 2021, 8:56am

Hi Michelle,

Thanks for bringing this interesting question.
I do not have an exact answer for that. My understanding of Transfer Learning is this one:

Transfer learning is usually done for tasks where your dataset has too little data to train a full-scale model from scratch.

Thus instead of looking for a range or a number that might be different depending on the data type and each application, I would also find a motivation regarding performance and re-using computational resources. Of course the baseline model that you want to use as a reference needs to be trained on a big dataset, but how big does it need to be? You need to study each use case in a separate way, for instance thousands of pictures might be enough in one specific classifier type but not in another case.

Please do read this article about the topic:

Best,

Rosa

ai_curious · August 27, 2021, 10:19am

Another reason for transfer learning is that there exist highly developed and good performing models, trained by experts on massive data sets, but maybe they output the wrong shape. Say, it was trained on ImageNet and handles 1,000 classes, but you only want to classify images as dog or cat. You can take an existing model, ResNet or MobileNet, and modify only the layer producing the output shape. Vastly reduces the number of parameters to train. I think the idea is ‘Google trained MobileNet for us, let’s try to use as much as we can.’

Topic		Replies	Views
Question About Transfer Learning Convolutional Neural Networks	5	602	May 31, 2022
Transfer learning with large data Structuring Machine Learning Projects	2	565	November 2, 2021
Deep Learning Network AI Discussions	2	116	July 12, 2023
Using Transfer Learning to deal with Data Mismatch Structuring Machine Learning Projects	1	560	May 31, 2021
Instantiate the ImageDataGenerator class Convolutional Neural Networks in TensorFlow week-4	12	589	June 20, 2022

About Transfer Learning Limitations

Related topics