Question about Course 4 <Inception network motivation>

Luoli_Wang · January 29, 2024, 6:55am

In course 4 week 2 motivation for inception network, the lecture mentions that when you apply the max-pool on 28x28x192 input image, it end up with 28x28x32 output matrix. How can this be possible? I thought the max-pool will be apply on each channel of input independently, which means that the number of channels in the output matrix should be the same as the number of channels in the input matrix (192). Am i misunderstand the idea of max-pool? Or is it just a typo? Thank you in advance for clarifying this!

hackyon · January 29, 2024, 1:12pm

Great catch. You are correct in that max-pooling should not change the number of channels.

I think there was an omission in part of the video, see this post on stackoverflow. Basically there was still a 1x1 conv applied after max-pooling to reduce the number of channels.

I’ve checked over the original paper to verify that the answer on stackoverflow is indeed correct.

ngkhatu · January 29, 2024, 8:06pm

I believe this was clarified at 2:10 in the video. And also have in my notes… We need to use padding to match dimensions.

Either of these seems to work using formula (((n+2p-f)/s) + 1) :

n=28, p=0, f=1, s=1, nc= 32 → 28 x 28 x 32
n=28, p=1, f=3, s=1, nc=32 → 28 x 28 x 32

Keeping in mind that the filter itself is actually f x f x 192 max-pool

So even if the max-pool filter is 1x1 we’re finding max across the 192 dimension. This filter effectively reduces to 28 x 28 x 1

rmwkwok · February 1, 2024, 12:10am

Hello @ngkhatu,

Padding is for matching the spatial dimensions, whereas the OP was questioning for the channel dimension. Also, max-pooling only reduces the spatial dimensions, but not the channel dimension.

I think @hackyon’s findings can explain the change in the channel dimension.

Cheers,
Raymond

Topic		Replies	Views
Inception Model video Convolutional Neural Networks	1	523	May 20, 2022
Inception Network Architecture Convolutional Neural Networks week-2	5	228	May 13, 2024
Dimension of output layer of max pooling in Inception Network Convolutional Neural Networks	6	718	November 16, 2022
MaxPooling in Inception model Convolutional Neural Networks	1	548	May 29, 2021
Course 4 Week 2 Video: Inception Network Motivation Convolutional Neural Networks	1	523	July 5, 2022

Question about Course 4 <Inception network motivation>

Related topics