W2 Test Question 5

jonaslalin · September 28, 2021, 8:13am

From the original ResNet Deep Residual Learning for Image Recognition:

This reformulation is motivated by the counterintuitive phenomena about the degradation problem (Fig. 1, left). As we discussed in the introduction, if the added layers can be constructed as identity mappings, a deeper model should have training error no greater than its shallower counterpart. The degradation problem suggests that the solvers might have difficulties in approximating identity mappings by multiple nonlinear layers. With the residual learning reformulation, if identity mappings are optimal, the solvers may simply drive the weights of the multiple nonlinear layers toward zero to approach identity mappings.
In real cases, it is unlikely that identity mappings are optimal, but our reformulation may help to precondition the problem. If the optimal function is closer to an identity mapping than to a zero mapping, it should be easier for the solver to find the perturbations with reference to an identity mapping, than to learn the function as a new one. We show by experiments (Fig. 7) that the learned residual functions in general have small responses, suggesting that identity mappings provide reasonable preconditioning.

Hence,

Topic		Replies	Views
Skip connections in ResNets Convolutional Neural Networks coursera-platform	2	605	October 3, 2021
Why learning identity function will give RN better performance? Convolutional Neural Networks coursera-platform	2	589	November 4, 2022
Resnet Lecture Clarification Convolutional Neural Networks coursera-platform	2	546	October 31, 2021
Motivation for resnets Convolutional Neural Networks coursera-platform	1	553	October 14, 2021
I'm confused why RestNet works Convolutional Neural Networks week-module-2 , coursera-platform	3	71	January 24, 2025

W2 Test Question 5

Related topics