I'm confused why RestNet works

sunblockisneeded · January 24, 2025, 5:19am

In the lecture, they show through a series of assumptions where 𝑤[𝑙+2]
and the bias are set to 0 that “a[l+2]=a[l]” can occur.

However, I am confused about why this implies an identity function and why it means ResNet is easier to train.

Here’s how I understand it: if training fails between
a[l] and a[l+1],
then a[l+2] can back up from a[l] instead of relying on the faulty a[l+1]. In that case, the network can simply ignore the weights coming from a[l+1] (like backup system)

Is my understanding correct?
and…is there an underlying assumption that the identity function is difficult to create by backprop??

Alireza_Saei · January 24, 2025, 7:04am

Hi @sunblockisneeded

If learning fails for the transformation between a[l] and a[l+1] , the residual connection allows a[l+2] to simply copy a[l] via the identity mapping. This bypasses the faulty transformation, helps the network to fallback on the earlier representation. This makes training easier because the residual connection provides a direct gradient flow during backpropagation.

Just like you said, ResNet simplifies this by explicitly providing the identity mapping through skip connections that reduces the load on the network to learn it from scratch.

Hope it helps! Feel free to ask if you need further assistance.

sunblockisneeded · January 24, 2025, 7:40am

thx for your answer. Your explanation helped me to have a more robust neural network in my brain.

Alireza_Saei · January 24, 2025, 7:41am

You’re welcome! happy to help

Topic		Replies	Views
Why learning identity function will give RN better performance? Convolutional Neural Networks coursera-platform	2	535	November 4, 2022
Resnet identity mapping Convolutional Neural Networks coursera-platform	1	631	May 4, 2022
Resnet Lecture Clarification Convolutional Neural Networks coursera-platform	2	532	October 31, 2021
C4W2: About what "Residual block is easy to learn identity function" means Convolutional Neural Networks coursera-platform	1	344	October 7, 2023
W2 Test Question 5 Convolutional Neural Networks coursera-platform	2	566	October 29, 2021

I'm confused why RestNet works

Related topics