Wrong value error in UNQ_C6

gilad.danini · May 22, 2024, 7:17am

Hello.
The automatic checker fails with:
AssertionError Traceback (most recent call last)
~\AppData\Local\Temp\ipykernel_37688\3686812772.py in
1 # UNIT TEST
----> 2 DecoderLayer_test(DecoderLayer, create_look_ahead_mask)

c:\gilad\my courses\coursera\Reccurent Neural Networks\W4A1\public_tests.py in DecoderLayer_test(target, create_look_ahead_mask)
178 assert tuple(tf.shape(out).numpy()) == q.shape, f"Wrong shape. We expected {q.shape}"
179
→ 180 assert np.allclose(attn_w_b1[0, 0, 1], [0.5271505, 0.47284946, 0.], atol=1e-2), “Wrong values in attn_w_b1. Check the call to self.mha1”
181 assert np.allclose(attn_w_b2[0, 0, 1], [0.32048798, 0.390301, 0.28921106]), “Wrong values in attn_w_b2. Check the call to self.mha2”
182 assert np.allclose(out[0, 0], [-0.22109576, -1.5455486, 0.852692, 0.9139523]), “Wrong values in out”

AssertionError: Wrong values in attn_w_b1. Check the call to self.mha1

I double checked my code, but cannot find the problem.
Please help. Thx, Gilad

saifkhanengr · May 22, 2024, 7:25am

Please send me your code of this function in a private message. Click my name and message.

Deepti_Prasad · May 22, 2024, 7:33am

Always mention Assignment too in your header or explanation.

You need to check how you have recalled self.mha1? which you will use to get the correct values for attn_w_b1

Read these instructions

Block 1 is a multi-head attention layer with a residual connection, and look-ahead mask. Like in the EncoderLayer, Dropout is defined within the multi-head attention layer.
The first two blocks are fairly similar to the EncoderLayer except you will return attention_scores when computing self-attention

so make sure you have return_attention_scores=True, look_ahead_mask

Also usual mistake in this assignment is learner using training recall which is not required here for attn_w_b1

Regards
DP

saifkhanengr · May 22, 2024, 8:13am

Thank you for sending me your code, @gilad.danini!
Your code for BLOCK 2 is incorrect. See the instructions again.

# calculate self-attention using the Q from the first block and K and V from the encoder output.

Here, what is the encoder output? It’s not x. Also, for Block 2, do we need look_ahead_mask or padding_mask?

gilad.danini · May 22, 2024, 11:17am

Got ya.
THx
We put the look ahead mask only on the actual input to the decoder?

Deepti_Prasad · May 22, 2024, 11:30am

yes but for attn_w_b1

hope you are not mixing with block1 and block2. Re-read the instructions again and again, point by point, see if you have followed it as per instructions.

gilad.danini · May 22, 2024, 2:08pm

OK THX. I am in the final transformer implementation now.

Topic		Replies	Views
C5 W4 A1 DecoderLayer Ex6 Sequence Models coursera-platform	11	699	November 18, 2024
Decoder Layer "Wrong values in out" Sequence Models coursera-platform	3	459	August 23, 2023
Programming Assignment: Transformers Architecture with TensorFlow-Exercise 6 - DecoderLayer Deep Learning Resources	3	331	February 13, 2024
C5_W4_A1_Transformer_Subclass_v1 UNQ_C6 Wrong values Sequence Models coursera-platform	2	562	February 19, 2023
C5W4 - Cell #20. Can't compile the student's code. Error: AssertionError('Wrong values in attn_w_b2. Check the call to self.mha2') Sequence Models coursera-platform	5	1629	October 2, 2022

Wrong value error in UNQ_C6

Related topics