My code is correct, but I don’t understand

Jingni_Huang · October 25, 2023, 10:32pm

week 1, second Lab
For Keira’s.layer,
Why here use tfl.ZeroPadding2D, not ZeroPadding3D? I know that on the Tensoflow web mention it: 2D is image. But our input is (64, 64, 3), if I don’t misunderstand, 64, 64 are the image height and width, 3shall be channels. Why we don’t say The Who object is 3D?
And how come here padding 3, not 4,5,6?
However, the BatchNomilization uses axis = 3, 2D has axis 3?

balaji.ambresh · October 26, 2023, 3:26am

2D convolution operations (conv2d, *pooling2d) come from the fact that you provide details about width and height. The 3rd dimension on number of channels is inferred from the input.

As far as batch normalization is concerned, the exact axis over which batch normalization has to be performed should be specified and hence the use of axis=3.

paulinpaloalto · October 26, 2023, 3:43am

Yes, the 2D versus 3D is referring to the “spatial” dimensions of the input values. For the 2D case, you have 2 spatial dimensions: height and width, e.g. for an image, and then the third dimension is “channels”, which would typically be the color values of the pixels in the image case. But you can also have images with 3 spatial dimensions: consider the case of CT scans or other medical scans which have 3 spatial dimensions: height, width and depth. And then you have a “channels” dimension that will give the values of the sensors corresponding to the point in 3D space. So the tensors will be 4D, but with 3 spatial dimensions. In that case, you would use Conv3D or Pooling3D.

rmwkwok · October 26, 2023, 4:05am

Besides all the comments above, I would also recommend you to

In the tensorflow documentation about ZeroPadding2D, read the whole page and especially the Input shape section. The page tells us what it does, while the input shape section tells us what it expects for.
In the assignment, print some shapes of our input - X_train. It is definitely not (64, 64, 3).

Good luck!
Raymond

Topic		Replies	Views
Week 1 - Convolution_model_Application (Understanding) Convolutional Neural Networks coursera-platform	2	556	July 24, 2022
Helpful Advice for Week 1 Assignment 2 Exercise 1 Convolutional Neural Networks coursera-platform	11	1622	July 12, 2024
Course 4, Week 1, Assignment 2: Why is batch normalization applied (only) to axis 3? Convolutional Neural Networks coursera-platform	2	525	May 10, 2022
C4,W1,A2 E1 BatchNormalization for axis 3 Convolutional Neural Networks coursera-platform	5	543	November 29, 2022
TF batch norm for CNNs question Convolutional Neural Networks coursera-platform	5	479	May 24, 2023

My code is correct, but I don’t understand

Related topics