C4_W2 ResNet50 implementation model summary error

Hardik_Bhortakke · June 12, 2024, 5:42am

Hi, I am facing a test failed error in programming assignment of week 2 of Convolutional neural network course in which the Model summary of my model isi shown as
[‘TFOpLambda’, (None, 15, 15, 256), 0]
while the expected is :
[‘Add’, (None, 15, 15, 256), 0]
how to rectify this error as the grader shows incorrect implementation of the ResNet50 model

Link to the classroom item I am referring to : Coursera | Online Courses & Credentials From Top Educators. Join for Free | Coursera

TMosh · June 12, 2024, 5:44am

Did you use a “lambda” function in the notebook?
If so, don’t do that.

Hardik_Bhortakke · June 12, 2024, 5:48am

There is a cell marked as “you can not edit this cell” which uses initilizer as lambda in identity block

Hardik_Bhortakke · June 12, 2024, 5:50am

But that’s in the block testing cell and there isn’t any use of lambda function elsewhere

TMosh · June 12, 2024, 6:02am

Can you post an image of the entire model summary?

Hardik_Bhortakke · June 12, 2024, 9:13am

Got it. After looking at the complete summary i realized that in the identity block instead of using the Add() function, i had simply used the ‘+’ operator. Thanks for the help .

paulinpaloalto · June 12, 2024, 2:51pm

Yes, the two operations are equivalent, but the way the tests work here is that they do a literal compare of the “summary” output, so you have to use the exact function that they tell you to use in the instructions. Just being logically correct isn’t good enough.

But they did their best to help you out here by being very explicit in the instructions about how to implement that logic.

Hardik_Bhortakke · June 12, 2024, 5:49pm

But one thing that’s bugging me is upon using ‘+’ operator the accuracy of the model upon training is 0.7666666507720947 while using Add() function gives accuracy of 0.7083333134651184. I’ve tried it more than once but same results.

TMosh · June 12, 2024, 5:49pm

When doing this sort of testing, be sure you “restart the kernel and clear all the output” every time you start a test.

paulinpaloalto · June 12, 2024, 7:55pm

Also note that even if you use Tom’s method to always start from the same state, the results still are not deterministic even if you don’t change the code. I ran it three times from the “reset” state with the Add() implementation and got three different values for the Test Accuracy:

0.85
0.866666
0.766666

Even when you set the random seeds for the PRNG algorithms, the results are still not deterministic, because the training is parallelized across multiple CPUs and GPUs. Parallelism is inherently non-deterministic, since exactly how the threads get scheduled depends on everything ele that’s happening on the computer at the same time. There are ways to artificially constrain that to be deterministic, but then you lose most of the advantages of parallelization and it really costs you in terms of performance. Here’s a thread from mentor Raymond which discusses this point in a lot more detail.

Topic		Replies	Views
ResNet Assignment Week 2 Convolutional Neural Networks coursera-platform	6	534	March 1, 2023
Help! Residual conv network Programming Assignment Convolutional Neural Networks coursera-platform	15	645	December 2, 2023
Week 2 Lab 1 W2A1 Residual_Networks.ipynb Convolutional Neural Networks coursera-platform	2	559	March 28, 2022
Week 2 Assignment 1, Exercise 3--Resnet50 error Convolutional Neural Networks coursera-platform	7	617	July 7, 2021
Trouble Implementing Convolutional Blocks and ResNet50 Model for Week 2 Programming Assignment Convolutional Neural Networks week-2 , coursera-platform	7	257	February 17, 2024

C4_W2 ResNet50 implementation model summary error

Related topics