In what way a residual block creates a shortcut[ResNet]?

Alan_Skibinski · February 5, 2023, 10:57am

Hi,

in ResNets video at 2:37 Andrew says:

"So rather than needing to follow the main path, the information from a[l] can now follow a shortcut to go much deeper into the neural network. "

But doesn’t z[l+2] need to be calculated the “normal” way in either case - with or without res. block? If yes then calling this “shortcut” - something easier and faster also confuses me. If we need to compute z[l+2] anyway then adding a[l] to the result is an additional operation, not shortcut.

I don’t want to be picky but I’m not sure if I misunderstood the wording or the concept of ResNets.

rmwkwok · February 5, 2023, 11:26am

Hello @Alan_Skibinski,

I think the term “shortcut” is about the connection - the purple line. a^{[l]} goes through both the main and the shortcut path. Relative to the main path, the shortcut path is a shortcut.

The shortcut doesn’t make the main path simpler, but it lets a copy of a^{[l]} reach there without having to get through the main path.

Raymond

Alan_Skibinski · February 5, 2023, 11:54am

Thanks. True, relative to the main path it is, but a[l] has to get to z[l+2] through the main path too and then, additionally, it gets there through the “shortcut”. If so, “rather than” from Andrew’s quote above could also mean “in addition to”. So we first calculate z[l+2] in the normal way and then add a[l]?

rmwkwok · February 5, 2023, 11:58am

The calculation is clear:-

Yes, we calculate z^{[l+2]} coming from the main path and then add a^{[l]} coming from the shortcut.

Raymond

Alan_Skibinski · February 5, 2023, 12:17pm

Great, thanks for clearing this up.

Topic		Replies	Views
Meaning of 'short cut' in ResNet Convolutional Neural Networks	5	538	August 4, 2022
ResNet confusion - Course 4 - Week 2 Convolutional Neural Networks	4	547	April 28, 2022
C4 Week 2 Clarification in RESNET Convolutional Neural Networks	4	499	February 13, 2023
RESNET Explanation Convolutional Neural Networks	1	481	August 26, 2022
Why do ResNets work? Convolutional Neural Networks	3	513	February 21, 2023

In what way a residual block creates a shortcut[ResNet]?

Related topics