Course 4[Week 1] Convolutions Over Volume - Output image Size

pushkarp6 · May 21, 2021, 7:57am

Well, as Andrew taught in the Convolutions over Volume video that we get 2d output for a convoluted RGB image considering a single filter.
i.e. 6x6x3 * 3x3x3 = 4x4
Why can’t the output be 4x4x3?

Viktoriia · May 21, 2021, 11:45am

Hello @pushkarp6

This is how the convolution over volume is defined.
In this case you have 6x6 image with 3 layers (r,g,b) and you are using 3x3 filter with 3 identical layers thus summing on each step 3x3x3 = 27 number yielding a flat 4x4 image

you can define a new operation and call it differently where on each step you would sum 9 values 3 times keeping three layers. the only question is why ?

hth

kushaldev · May 21, 2021, 12:03pm

Hii @pushkarp6 ,
Basically reason behind doing these convolutions is to achieve main goal of extracting features and intuitvely when we will combine the representations over all the 3 channels than only we can get best representation of the extracted feature becuse in general analogy with humans also, if you see any image than you don’t observe its features on different scales but you do as a whole to build abstract idea of patterns, colors etc. That’s my intuition! Also mathematically, @Viktoriia reply is perfect and thought provoking!

pushkarp6 · May 22, 2021, 6:38am

@Viktoriia Thank you so much for the explanation!!
The last statement is exactly where I am stuck

pushkarp6 · May 22, 2021, 6:39am

@kushaldev Great explanation.
Thank you so much!!!

Viktoriia · May 22, 2021, 8:24am

@pushkarp6 my pleasure! You are welcome! Good luck for the rest exercises.

kushaldev · May 22, 2021, 11:11am

@pushkarp6 always a pleasure to think and discuss about possibilities and ideas!

Topic		Replies	Views
Course 4 Week 4 Quiz: Possible error in quiz Convolutional Neural Networks	2	547	August 29, 2022
Question in week 4 3D Convolution Convolutional Neural Networks	4	305	November 5, 2023
W4_Quiz_3D Convolution Convolutional Neural Networks	6	1452	August 7, 2022
In start they say 6x6x3 image 3 is color channel but now they are telling 37x37x40 40 is filter size. anybody can elaborate this Convolutional Neural Networks	3	497	December 20, 2022
Course4,Week4,quiz Convolutional Neural Networks	1	554	January 23, 2022

Course 4[Week 1] Convolutions Over Volume - Output image Size

Related topics