Attention assignment, model summary mismatch

Sacha_B · May 15, 2021, 9:16pm

Hi,
I am getting the following error when submitting my Assignment 1 Week 4 for grading:
Please let me know if this is a known issue with the grader, as the code passes the tests without raising any error:

Code Cell UNQ_C1: Function 'one_step_attention' is correct.
cell_UNQ_C2 function modelf summary does not match the expected model 
If you see many functions being marked as incorrect, try to trace back your steps & identify if there is an incorrect function that is being used in other steps.
This dependency may be the cause of the errors.

Sacha_B · May 15, 2021, 11:33pm

The issue seems to come from the fact that the model’s layers, as displayed in model.summary() are not in the same order as specified in the modelf function. The grader function is probably assuming that the oder is respected, hence the error in the grader. @DLS_Mentors , please advise on possible solutions, as I am facing strict deadlines. Thank you so much for your help.

paulinpaloalto · May 15, 2021, 11:43pm

The deadlines here are fake, so don’t stress about those.

Funny, I just ended up seeing this same problem in Course 4 Week 2 in the Residual Net exercise. It turns out that model summary gets constructed based on the computational graph, so order can matter in the case that you have “skip” layers followed by an “Add” layer. In that case, since addition is commutative, you’d expect the order of the operands not to matter, but it does in terms of the order of the computation graph. Try looking for that situation in your code and experimenting with the order of operands.

Just for reference, here’s the other thread that discusses this type of issue in C4.

Mubsi · May 16, 2021, 3:38am

Hey @Sacha_B, please refrain from using words like “issue with grader” in topic headers as they indicate something might be wrong with the grader, when in fact, it is functioning properly. It can give the wrong idea to the other learners as well.
I have removed and renamed your post to “Attention assignment, modelf summary mismatch”.
Thank you.

Sacha_B · May 17, 2021, 10:30am

Thanks for your answers. I am probably doing a silly error since I am the only one reporting this issue, but for some reason, the following code:
input_x = Input(shape=(Tx, human_vocab_size), name = ‘inputs’)
s0 = Input(shape=(n_s,), name=‘s0’)
c0 = Input(shape=(n_s,), name=‘c0’)
s = s0
c = c0

creates the s0 layer before the input_x layer, and inter-changing the order, as proposed by @paulinpaloalto does not change the model’s output:
[[‘InputLayer’, [(None, 64)], 0], [‘InputLayer’, [(None, 30, 37)], 0], [‘RepeatVector’, (None, 30, 64), 0, 30], [‘Bidirectional’, (None, 30, 64), 17920], [‘Concatenate’, (None, 30, 128), 0], [‘Dense’, (None, 30, 10), 1290, ‘tanh’], [‘Dense’, (None, 30, 1), 11, ‘relu’], [‘Activation’, (None, 30, 1), 0], [‘Dot’, (None, 1, 64), 0], [‘InputLayer’, [(None, 64)], 0], [‘LSTM’, [(None, 64), (None, 64), (None, 64)], 33024, [(None, 1, 64), (None, 64), (None, 64)], ‘tanh’], [‘Dense’, (None, 11), 715, ‘softmax’]]

Of course, I can re-arrange the expected summary as a temporary solution, but since the grader is looking for exactly the right order, the grading eventually fails, hence the use of “issue with grader” @Mubsi .
Thank you for your help.

Sacha_B · May 17, 2021, 3:34pm

I must also add that the remaining code runs without error, so I don’t suspect the problem is coming from my implementation, shouldn’t the grader consider the fact that layers are created in the computational graph in (almost) a random order and accept the code in this case @Mubsi ? Thank you

dtphap · June 21, 2021, 7:45am

@Sacha_B I got the same error and it turns out that in the implementation of the one_step_attention I looked at the image and typed wrong order of a and s_prev

Topic		Replies	Views
All tests passed. But, grader fails modelf Sequence Models coursera-platform	4	475	May 31, 2023
Course 5, Week 3, Assignment 1, Exercise 2 summary(model) Sequence Models coursera-platform	2	566	October 29, 2021
Comparator Error for Assignment Neural_machine_translation_with_attention Week 3 Sequence Models coursera-platform	8	739	March 11, 2024
Course 5 Week 4 Programming Assignment Grading Error Sequence Models coursera-platform	2	994	June 29, 2022
W1_Assign3_ Your results may likely differ because Keras' results are not completely predictable Sequence Models coursera-platform	2	511	December 19, 2022

Attention assignment, model summary mismatch

Related topics