Resnet identity mapping

Jaime_Gonzalez · May 4, 2022, 9:35am

I believe I understand your confusion. Take a look at this image of a residual block:

When we state that “the identity function is easy for a residual block to learn” what we mean is that, due to the skip connection, if F(x) in the image approaches 0 (due to, for example, L2 regularization), the ResNet block will not return 0 (the activation of the ResNet block will not be 0), it will be X, the original tensor that entered the Residual block.

To be clear, ‘the identity’ or the ‘identity function’ is “a transformation that leaves an object unchanged”, therefore, in the image, the arrow that says ‘identity’ means we carry the X tensor forward to the summation as it is, without altering it, as the identity function does nothing to it.

Saying ‘apply the identity transformation/function to the tensor’ instead of ‘do nothing to the tensor’ is an idea brought to deep learning from matrix operations in mathematics, where the identity transformation is the transformation you apply to a matrix X and get X as a result.

For a more in depth explanation, rewatch Andrew Ng’s explanation here, it is very good: Coursera | Online Courses & Credentials From Top Educators. Join for Free | Coursera

Otherwise read this, which may also help: on the topic read this:

Topic		Replies	Views
C4W2: About what "Residual block is easy to learn identity function" means Convolutional Neural Networks	1	337	October 7, 2023
I'm confused why RestNet works Convolutional Neural Networks week-2	3	25	January 24, 2025
Why learning identity function will give RN better performance? Convolutional Neural Networks	2	521	November 4, 2022
Week 2, ResNets(Identity Function) Convolutional Neural Networks	7	560	July 15, 2022
Question seems to have swapped answer with another Convolutional Neural Networks	2	1028	March 12, 2023

Resnet identity mapping

Related topics