C4W2 Assignment: decoder not passing tests

MatejPolak · June 21, 2024, 12:18pm

Hi, I’m having problems completing the Week 2 assignment.

Exercise 1 (attention mechanism) and 2 (decoder layer) are both passing all the tests. But I cannot get exercise 3 (decoder) to pass the tests:

Failed test case: Wrong values in x.
Expected: [1.6461557, -0.7657816, -0.04255769, -0.8378165]
Got: [ 1.5914114 -0.4431648 -0.0210509 -1.1271956]

Failed test case: Wrong values in outd when training=True.
Expected: [1.6286429, -0.7686589, 0.00983591, -0.86982]
Got: [ 1.5467136  -1.175141    0.09500299 -0.46657562]

Failed test case: Wrong values in outd when training=True and use padding mask.
Expected: [1.390952, 0.2794097, -0.2910638, -1.3792979]
Got: [ 1.5235926   0.24524598 -0.7199153  -1.0489233 ]

If anyone would be so kind to have a look at my solution, I’ll happily send it privately.
Thanks!

Alireza_Saei · June 21, 2024, 3:35pm

Hi @MatejPolak

Try to find the problem in your code by yourself (e.g. debugging, printing intermediate values, etc.). However, if you feel stuck, I can take a look at your code!

Deepti_Prasad · June 21, 2024, 5:31pm

Hi @MatejPolak

for a decode test to fail your output shows, you have incorrect x and also wrong values with training=true and use of padding mask.

Debugging task would be

to check first if you applied the same layer recall you used in attention mechanism. Also check the instructions section which mentions about the padding value.

next in case you have followed the above steps, then it means you need to go back in previous grader cell to check if you recalled each layer according to the instructions given.

in case you are able to find, kindly send screenshot of codes to any of the mentors who like them to check codes.

Your output gives details about where you might be going wrong, so re-read instructions again and again. Use search tool here, for learners who were stuck in the same grade cells. I am sure you will be able to debug!!!

Regards
DP

MatejPolak · June 24, 2024, 2:35pm

Hi,

thanks to @Alireza_Saei 's kind review of my code, I found that the problem was that I was not using the argument “training” correctly in exercise 2 (!). I used this:

mult_attn_out1, attn_weights_block1 = self.mha1(..., training=True, ...)

instead of this:

mult_attn_out1, attn_weights_block1 = self.mha1(..., training=training, ...)

The exercise 2 tests did not catch this error.

Thanks for the help!

Cheers,
Matej

Topic		Replies	Views
C4CW2 Failing Test Cases for Exercise 3 - Decoder NLP with Attention Models week-2	2	418	January 8, 2024
C4W2_Assignment - Ex 7 Decoder Layer output NLP with Attention Models week-2	12	374	April 4, 2024
C4W2_Assignment Transformer Summarizer Exercise 3 Decoder Failed test cases NLP with Attention Models week-2	14	70	October 21, 2024
C4W2 Exercise 2 - sample test is correct but unit test cases are correct function is failing NLP with Attention Models week-2	5	46	October 20, 2024
C4W2 Assignment DecoderLayer NLP with Attention Models week-2	7	538	April 19, 2024

C4W2 Assignment: decoder not passing tests

Related topics