Why use 1x1 Conv2d of stride 2 in resnet block?

P_R_Siddharthan · March 13, 2022, 2:33pm

Hi, I am looking for reasons why 1x1 with stride 2 is the first component of the main path in the ResNet assignment.

This is literally skipping half of the output from the previous layer. I would perform the stride 2 operation in the second component of the main block because at least there we use fxf filter. (Or)
It would be better if the previous block had a max pool with stride 2 because at least then we would be dropping values that are less important.

Why even bother computing these numbers if they are going to be dropped?

paulinpaloalto · March 13, 2022, 4:35pm

It’s an excellent point that has been brought up before, but none of the previous discussions have really found any explanation or justification for doing this. If the goal is to reduce the size of the output at a given layer, a pooling layer would also achieve that with less loss of information. Although you’d then need to follow that with a 1 x 1 Conv layer with stride of 1 to really get the same effect. Of course, that would be more computationally expensive. But exactly as you say, it seems strange to literally ignore half the inputs at various layers.

I have not taken the trouble to go read any of the papers on Residual Nets. The hope would be that they might comment on this aspect, but there is no guarantee. If anyone has the time and energy to pursue that, please let us know what you find!

Topic		Replies	Views
Week 2 ResNet programming exercise: the use of one-by-one convolution Convolutional Neural Networks coursera-platform	3	546	August 29, 2022
Week 2 assignment 1, 1x1 convolutions question Convolutional Neural Networks coursera-platform	5	711	June 23, 2022
[Data loss] Convolutional Block (1x1) with stride > 1 in ResNet50 Convolutional Neural Networks coursera-platform	1	547	May 14, 2022
Data loss in ResNetv50 Convolutional Neural Networks coursera-platform	2	515	October 26, 2021
Question about ResNet-50 model Convolutional Neural Networks coursera-platform	1	437	August 27, 2023

Why use 1x1 Conv2d of stride 2 in resnet block?

Related topics