Residual layers explained

Mihail-Iulian_Plesa · May 8, 2023, 9:18am

Hi!

I’m wondering what is the point of the residual layers. If a[l] is equal to a[l+2] then what is the purpose of those layers? Why not just connect the output of layer l with the input of layer l+2.

Thank you!

gent.spah · May 8, 2023, 10:01am

I am sure there are plenty of posts in this part of the forum for residual connections if you search. The basic idea is this:

because of diminishing/exploding gradients issue with large NN, it is good to have a previous copy of the trainable parameters so these can be used again in the far ahead layers (this is like having a backup copy of the previously learned information). And that’s what you do, you connect l as input to l+2 including also as input l+1, concatenated. This will ensure that the large NN can still propagate learning to the output.

Mihail-Iulian_Plesa · May 8, 2023, 10:32am

Thank you and sorry for not searching more carefully

Topic		Replies	Views
Understanding of Add & Normal layer NLP with Attention Models week-2	2	461	June 26, 2023
Sense of ResNet Convolutional Neural Networks	1	491	May 16, 2023
Why do ResNets work? Convolutional Neural Networks	3	512	February 21, 2023
ResNets Question Convolutional Neural Networks	5	593	June 20, 2024
Residual Connection - Exploding Gradients Convolutional Neural Networks	1	589	June 4, 2021

Residual layers explained

Related topics