C4 W2 Assignment 2 Exercise 2 (Alpaca model): Apply data augmentation to the inputs

John_Viggiano · March 26, 2024, 3:14am

I’m sure I’m missing something obvious here, but I’ve been beating my head against the wall (metaphorically) for a couple of weeks on this.

I understand transfer learning; I’ve not only used it in the past, but have taught it (using Matlab, on the Alexnet model).

I found another post on this topic, but the discussion there did not enlighten me.

The previous use of the augmenter required iteration over a set of images. Clearly, we want to apply it only to the training images, yet there does not appear to be any partitioning of the inputs into training, validation, and test sets in the function parameters.

Does application of the augmenter require iteration over each image? Is the augmentation somehow supressed automatically during inference and only applied during training? Should the right hand side of the "x = " be a list comprehension?

As I said earlier, I’m convinced I’m missing something obvious.

balaji.ambresh · March 26, 2024, 4:02am

See section 1.1 where train_dataset and validation_dataset are defined. Since we provide the same parameters except subset, it’s the same as creating non-overlapping training and validation sets of split sizes 80% and 20%. image_dataset_from_directory is responsible for iterating over images and providing input to the caller.

model.fit will internally call augmentation layers. These augmentation layers will perform their task only while training (see RandomFlip):

A preprocessing layer which randomly flips images during training.

When model training is inactive, these augmentation layers will let the input pass through without any modifications.

One way to control the training state of the model is to use tf.keras.backend.set_learning_phase

John_Viggiano · March 27, 2024, 2:53am

Thank you, balaji_ambresh, for responding to my question regarding whether the augmentation is applied during inference.

However, I’m still stuck! Do I iterate over all elements of input and apply the data augmenter? Would I use a list comprehension for this, or an explicit for loop? Do I need to call expand_dims before applying the data augmenter (I assume so, because that’s what was done in the example)?

The comment:
# apply data augmentation to the inputs
is puzzling, as no input is provided to the function; I realize we’re specifying a model here; how do I indicate what the input is?

balaji.ambresh · March 27, 2024, 9:06am

A batch of input is of shape [batch_dim, height, width, channels]. If you’re manually feeding a single image, use expand_dims to add a dummy dimension before feeding to a layer. Vectorization and batching play an important role in deep learning libraries. So, you don’t have to feed a single image at a time to a NN.

Shifting attention to alpaca_model, observe the input_shape parameter of the base_model. Though this reflects the shape of a single example, tensorflow internally performs mini batching based on the call to model.fit based on batch_size parameter. Providing a Dataset or arrays of Xs and Ys is sufficient.

See this link on transfer learing. Notice how a single image is fed to the model using tf.expand_dims as shown here. Look at this section on add a classification head where the invocation of data_augmentation Sequential model is done without worrying about the batch dimension.

Topic		Replies	Views
Stuck in alpaca model Convolutional Neural Networks	6	571	July 21, 2021
Doesn't Data Augmentation via Keras Preprocessing layers also augment the validation set? Convolutional Neural Networks	4	347	March 14, 2025
Week 2: 2nd Assignment - Exercise 2 - data_augmentation Convolutional Neural Networks	3	559	July 17, 2022
About data_ The problem of the augmenter Convolutional Neural Networks	1	500	June 26, 2022
Input Augmentation Convolutional Neural Networks	4	548	July 12, 2021

C4 W2 Assignment 2 Exercise 2 (Alpaca model): Apply data augmentation to the inputs

Related topics