I have some questions about Residual Networks Exercise

ssoin · June 28, 2021, 2:21am

in the First component of the convolutional_block() function, the kernel size is 1, but the stride depends on the param s , so i got a question: when s > 1 , will Conv2D function miss some features of the input X ? meanwhile the shotcut path Conv2D function also miss the same features. The bigger the s setting, the more features will be lost, and I don’t see anything to make up for it, so I think it’s a little bit inappropriate

paulinpaloalto · June 28, 2021, 3:42am

It is a good observation. I think you’re right: if the stride is > 1, then it will actually ignore some inputs completely. I do not have an explanation. We are just given this Residual Network architecture. The stride in the conv block function has a default value of 2 and is an optional parameter. It’s possible that in “real” applications of this they usually call this with a stride of 1, but I don’t know for sure.

Topic		Replies	Views
Course 4 week 2: Residual Networks - kernel size=1 stride = 2 Convolutional Neural Networks	5	812	November 18, 2021
DLS Course 4 Week 2 Exercise 1: 1x1 convolution with strides=2 Convolutional Neural Networks	3	596	February 20, 2024
Data loss in ResNetv50 Convolutional Neural Networks	2	515	October 26, 2021
[Data loss] Convolutional Block (1x1) with stride > 1 in ResNet50 Convolutional Neural Networks	1	547	May 14, 2022
Logic bug: convolutional_block() ignores a large fraction of its input? Convolutional Neural Networks	4	593	January 30, 2022

I have some questions about Residual Networks Exercise

Related topics