In which situation the identity block can have the same weights?

akksh01 · September 11, 2021, 8:58am

In the Understanding Residual networks video, the instructor said
"Why should you have three blocks of code here to go through the data when instead you could have a loop that runs the data through residual type 2 three times instead? Also, it could be the same weights in each block, so instead of each of the three blocks learning independent, wait separately, you get one block that is learned and executed three times. "
I did not completely understand why some blocks could have the same weights?

gent.spah · September 13, 2021, 8:53am

Even if they have the same weights it does not mean they extract the same information because they are executed in sequence, so they have an additive effect on each other. Maybe this way is less complex to the network and by the way there are so many parameters there that even when some are the same does not reduce the effectiveness of the network. When dropout for example is used some weights are not even taken into consideration.

Topic		Replies	Views
C1W4_Question for Implementing Loop in ResNet Custom Models, Layers and Loss Functions with TF week-4	3	331	November 16, 2023
C4W2: About what "Residual block is easy to learn identity function" means Convolutional Neural Networks	1	338	October 7, 2023
Why ResNets work? weight decay causes activations to be same Convolutional Neural Networks	2	433	July 10, 2023
How many Resnet Identity blocks? Convolutional Neural Networks	3	524	September 22, 2021
C4W2, Why is it easier for residual block to learn identity function? Convolutional Neural Networks	4	565	July 29, 2022

In which situation the identity block can have the same weights?

Related topics