C1W4_Question for Implementing Loop in ResNet

Sheng_Xue_Lim · November 15, 2023, 2:54am

Hi everyone. I have a question about the Residual Network. I don’t understand the rationale provided for changing 3 repeated residual blocks into the loop. The reason provided in the video is the “Don’t repeat yourself” programming principle, but I don’t see how they are related as 3 repeated residual blocks will give more trainable weights as compared to passing the data into the blocks 3 times.

Appreciate if any clarification could be provided. Thank you.

gent.spah · November 15, 2023, 7:07am

The point here is to automate programing so you don’t write the same code 3 times, it has nothing to do with trainable weights. The number of weights will be same if the number of blocks is the same.

Sheng_Xue_Lim · November 15, 2023, 11:01am

Yea. However by turning 3 blocks into 1 block with loops, it fundamentally change the architecture of the model and not just the programming style. Yet, the rationale provided for this modification in the video is to not repeat code.

If the purpose is not to repeat code, isnt the correct way is to take advantage of the var function of Python, as we did in the graded assignment?

gent.spah · November 16, 2023, 5:05am

3 separate blocks vs. 1 loop running 3 times is the same as having 3 blocks!

Topic		Replies	Views
In which situation the identity block can have the same weights? Custom Models, Layers and Loss Functions with TF week-4	1	545	September 13, 2021
C4W2, Why is it easier for residual block to learn identity function? Convolutional Neural Networks	4	565	July 29, 2022
How many Resnet Identity blocks? Convolutional Neural Networks	3	524	September 22, 2021
Week 2: Resnet50 Tips Question Convolutional Neural Networks	1	496	April 29, 2022
C4W2: About what "Residual block is easy to learn identity function" means Convolutional Neural Networks	1	338	October 7, 2023

C1W4_Question for Implementing Loop in ResNet

Related topics