Data augmentation increases the size of the training set?

Riccardo_Andreoni · May 9, 2022, 4:47pm

Hello, I have this question.

When using data augmentation on a training set of 2000 images, there will still be just 2000 images in the training set?

Or some copies of the images are created? Like, the original image is kept and also there is a new image that is flipped.

Thank you

m.abidat · May 9, 2022, 4:58pm

Hello @Riccardo_Andreoni, welcome to the community

As a matter of fact, the original 2000 images won’t be touched with any changes, as image augmentation doesn’t require you to edit your raw images. They will be loaded into memory,
and there, the augmentation operations will be performed on-the-fly while training using transforms.
As a result, you will have more than 2000 images for training without impacting your dataset.

cvetko.tim · May 9, 2022, 6:08pm

You will gain new images by performing data augmentation. It is a powerful task to avoid overfitting since you expose your model to different types of structural data.

Riccardo_Andreoni · May 11, 2022, 6:30am

Thank you for the reply, it’s very clear. Just for confirmation, the model will learn on both the original pictures and also on the transformed ones?
If so, how can I control how many transformations to perform on a single image?
I mean, who tells Tensorflow to apply just a rotation on the image instead of N rotations plus some shears?
Thank you!

VICTOR_DIAZ · February 3, 2023, 9:17am

Is there any way to calculate de number of new images generated after augmentation?

Bo_Xu · March 19, 2023, 12:32am

Same question: how many new examples are generated by augmentation? The output display of model_for_aug.fit(…) shows 100/100 when each epoch ends. So it appears that the number of batches is still 100, the same as when augmentation is not used. Either TensorFlow adds additional batches behind the scene that are not reflected in the 100 count, or it increases the batch size so each batch includes more than 20 examples. Which is the case?

fdam · March 25, 2023, 8:56pm

This is also my question (I just opened a topic for this…)

How many images I have after augmentation?

Topic		Replies	Views
Data augmentation technique does not augment the number of training set Convolutional Neural Networks in TensorFlow week-module-2	3	300	December 16, 2023
Data augmentation using tf.data AI Discussions	15	110	February 1, 2023
C2_W2_DataAugmentation_ImageDataGenerator Convolutional Neural Networks in TensorFlow week-module-2	5	507	March 19, 2023
Does Data Augmentation Increase Training Set Size? Machine Learning in Production	2	588	August 12, 2023
Course 4 Week 2: Data Augmentation (live vs increasing the dataset) Convolutional Neural Networks coursera-platform	4	542	November 4, 2021

Data augmentation increases the size of the training set?

Related topics