Meaning of 'short cut' in ResNet

s1rGAY · August 4, 2022, 2:00pm

Hello, I have one question about the “short cut” in ResNet. I can’t understand the meaning of the translation. I mean, what’s the point of processing information in permeable layers? For example: a[l] = [0.5, 0.4, 0.3] we transform by layers and get (for example) z[l+2] = [0.01, 0.02, 0.03]. And then to get [l+2], we have to take g(a[l]+z[l+2]) = g([0.51, 0.42, 0.33]).
If in general, what is the meaning of getting z[l+2]? And why should the resulting z[l+2] be added to the original a[l]?

alvaroramajo · August 4, 2022, 2:15pm

Hi, @s1rGAY !

The function of this shortcut connections is twofold. It makes the training process easier (check this paper) and solves the vanishing/exploding gradient issue, since those shortcut connections make the derivatives “pass through” the previous layers without being increased / decreased in each layer. With this mechanism, you can make deeper networks avoiding this problems.

s1rGAY · August 4, 2022, 2:33pm

Thank you @alvaroramajo ! Just For a complete understanding) When the model is used, we will not use Xidentity, but just go through the model directly ?

alvaroramajo · August 4, 2022, 2:49pm

I’m not sure I understand your question, but when you “pass through” the layers with the shortcut connection, what you actually do is a summation of the input and the output of that layer.

s1rGAY · August 4, 2022, 2:59pm

I mean, the model uses shortcut only when training(red arrows). And the trained model follows the path of the blue arrows

paulinpaloalto · August 4, 2022, 5:22pm

Both paths are part of the model. Both are used both during training (forward and back propagation) and also during prediction mode (forward prop only), using whatever the weights are that were learned during training.

Topic		Replies	Views
In what way a residual block creates a shortcut[ResNet]? Convolutional Neural Networks	4	526	February 5, 2023
RESNET Explanation Convolutional Neural Networks	1	481	August 26, 2022
ResNet confusion - Course 4 - Week 2 Convolutional Neural Networks	4	547	April 28, 2022
Skipped connection in ResNet Convolutional Neural Networks	4	523	March 28, 2024
Why do ResNets work? Convolutional Neural Networks	3	513	February 21, 2023

Meaning of 'short cut' in ResNet

Related topics