Exploring augmentation with horses vs. humans

Fawaz_Hussain_Ghamdi · December 7, 2022, 3:55am

In this video on the topic heading the instructor told us

“If the validation set doesn’t have the same randomness,

then its results can fluctuate like this,

So bear in mind that you don’t just

need a broad set of images for training,

you also need them for testing or

the image augmentation won’t help you very much.”

And I don’t know why would that make a difference in the accuracy of the validation set.

I would understand if the real-world examples or use results would vary from what we get from the validation set due to its lack of variance.

I hope someone can clear this up a little bit more for me.

Thank you in advance.

balaji.ambresh · December 7, 2022, 8:03am

Inspection of training pipeline is required when distribution of validation dataset is different from training dataset.
Image augmentation can help broaden training dataset distribution.

There are 2 cases when image augmentation is not the only solution and you need more images:

Insufficient image augmentation of training data doesn’t fully cover validation data distribution. Increasing training examples / augmentation is required to take care of this.
Validation set might be poorly designed. For instance, if all humans are standing the validation and augmented training images make the standing nature of human images hard to capture, validation accuracy might vary across epochs till the time the model has learnt to understand images in their augmented form to classify a human in standing form. The situation can be made better if there were more images in the validation set that look like the augmented training images.

Topic		Replies	Views
Is it useful to augment images in the validation set? Convolutional Neural Networks in TensorFlow week-2	7	831	January 6, 2022
Still overfitting on Horse or Humans dataset Convolutional Neural Networks in TensorFlow week-2	3	629	October 14, 2022
Data augmentation on validation set Convolutional Neural Networks in TensorFlow week-1	1	493	September 6, 2022
C2W2 Assignment - Why augment data if results degrade? Device-based Models with TensorFlow Lite week-2	1	550	August 29, 2022
Training vs Validation Accuracy and Loss Convolutional Neural Networks	5	636	June 2, 2022

Exploring augmentation with horses vs. humans

Related topics