Week 2, Assigment 1, Convolutional block (exercise 2)

Joao_Marcelo_Farias · February 12, 2023, 5:54pm

Hi. I’m making the exercise that is on title. I searched for answers but I couldn’t find one.

[snippet deleted by mentor]

The error I’m receiving says that my the values inside my output are wrong when training=False, although I don’t know if it will miss when it be true too.
Here is some of my code (I will make ):

normalization = BatchNormalization(axis=3)
relu = Activation('relu')
X = Conv2D(filter=F2, kernel_size=f, strides=1, padding='same', kernel_initializer=initializer(seed=0))(X)
X = normalization(X, training=training)
X = relu(X)

X = Conv2D(filters=F3, kernel_size=1, strides=1, kernel_initializer=initializer(seed=0))(X)
X = normalization(X, training=training)

X_shortcut = Conv2D(filters=F3, kernel_size=1, strides=s, kernel_initializer=initializer(seed=0))(X_shortcut)
X_shortcut = normalization(X_shortcut, training=training)

I didn’t put the code that I didn’t write

balaji.ambresh · February 12, 2023, 6:05pm

BatchNormalization layer shouldn’t be shared across 2nd and 3rd components.

paulinpaloalto · February 12, 2023, 6:06pm

There is no function called normalization in the notebook as given to us, so that must be something you created. Perhaps you didn’t define it correctly. The instructions tell us to use the BatchNormalization function from TF and they even give us examples of how to call it in the template code. Your calls should look the same as those, although in some cases the input tensor is different.

Joao_Marcelo_Farias · February 12, 2023, 6:22pm

It’s the name of the variable with value BatchNormalization(axis=3)

Joao_Marcelo_Farias · February 12, 2023, 6:23pm

Like, doesn’t make a variable to be BatchNormalization(axis=3)?

paulinpaloalto · February 12, 2023, 6:37pm

Ok that should be correct, if you specified the axis correctly. Notice that you’ve written it two different ways:

In the first post you show axis = True.

In the later ones you show axis = 3.

Please check that’s really 3.

balaji.ambresh · February 12, 2023, 6:38pm

I was the one who removed your image from the public post since you’re not allowed to share code in public.

As far as batch normalization is concerned, a new layer instance should be created. A layer is created when using something like Dense(...). When you call the layer with the parameters, like Dense(...)(...), the __call__ method is invoked.

One more mistake is that the shortcut path should not contain batch normalization after adding the intermediate results. Please see the image in the markdown above the exercise

Joao_Marcelo_Farias · February 12, 2023, 6:48pm

Thank you! It worked. Sorry for sending the image

Topic		Replies	Views
Course 4 week 2 CNN Convolutional Neural Networks	2	653	June 6, 2021
Week 2 - Exercise 2 - convolutional_block - ValueError: Operands could not be broadcast together with shapes (1, 1, 2) (4, 4, 3) Convolutional Neural Networks	8	986	September 17, 2021
Week 2 Assignment 1 Exercise 2 Convolutional block Convolutional Neural Networks	4	511	November 7, 2022
Week 2 , programming assignment Convolutional Neural Networks	3	673	January 16, 2022
Error in Identity block in week 2 Convolutional Neural Networks	4	670	July 13, 2021

Week 2, Assigment 1, Convolutional block (exercise 2)

Related topics