Data augmentation for devset

saiman · September 5, 2023, 9:31am

Hello

I got a question. it seems that the best practice is to augment data only for training split.
the reason behind no doing so for other splits is said to be augmented might not represent reality.

If we don’t want it in devset because of that reason, why would we like to train our model over such data?

Regards

Deepti_Prasad · September 5, 2023, 10:14am

Hello Salman,

Sometimes acquiring and labelling additional observations can be an expensive and time-consuming process.
Data augmentation techniques are used to generate additional, synthetic data using the data you have.

Augmentation method like rescaling/cropping, flipping, noise, rotation to get larger training data and make the model generalise better.

Although in the real world data preparation is done first and then splitting the dataset. But supposedly data augmentation of test data will create additional sample data which you can avoid based on one’s choice of predictive analysis .

However one needs to understand the main significance of data augmentation is generation of additional in case you are unable to get more data, or you due to economical and time-constraint conditions at the time of prediction analysis.

Regards
DP

Topic		Replies	Views
Does Data Augmentation apply only to train data? Introduction to Machine Learning in Production	2	657	July 12, 2021
Data augmentation on validation set Convolutional Neural Networks in TensorFlow week-1	1	493	September 6, 2022
Why would we augment validation data Convolutional Neural Networks in TensorFlow week-2	1	504	July 16, 2022
Why don't we merge augmentated data and original data Convolutional Neural Networks	11	743	September 6, 2021
Is it useful to augment images in the validation set? Convolutional Neural Networks in TensorFlow week-2	7	832	January 6, 2022

Data augmentation for devset

Related topics