How small is the 'small training set'?

Christian_Simonis · April 24, 2023, 4:34pm

Welcome to the community, @Adilbek_Salimgereyev: Good question!

In addition to @gent.spah‘s great reply:
How much data is needed to solve a specific problem can be quite challenging to determine in the conflict of interest between data acquisition cost and technical excellence. One way to quantity the (expected) information e.g. via (Shannon) entropy approaches can be active learning (AL) where model uncertainty can be utilized, see also: How much data does a CNN need to learn? - #2 by Christian_Simonis

So, Active learning can help to quantify:

which label is expected to provide a valuable benefit and also
when a sufficient amount of data has been used to train your model.

This thread on the batch could be interesting for you if you are interested in AL.
Many Thanks, @saifkhanengr, for the hint!

Best regards
Christian

Topic		Replies	Views
W1_Quiz_Large NN Models vs Traditional Learning Neural Networks and Deep Learning	3	658	February 5, 2023
Data Set Size for DL Structuring Machine Learning Projects	2	550	April 27, 2022
Week 1 Quiz question - re: smaller training sets Convolutional Neural Networks	4	825	January 14, 2025
How much images to collect for dataset to train a pre-trained model Neural Networks and Deep Learning	2	644	March 5, 2023
Making a training set Neural Networks and Deep Learning	5	494	August 27, 2023

How small is the 'small training set'?

Related topics