C5_W4_A1 Wrong values in attn_w_b2. Check the call to self.mha2

sergio_cr · April 7, 2025, 3:32pm

I am building out the decoder for UNQ_C6 and receive this error message:
AssertionError: Wrong values in attn_w_b2. Check the call to self.mha2

Right now I’m using the normalized sum of the attention output and input as Q. For K and V I’m using enc_output. I’m also adding padding_mask as specified in the function defintions.

I’ve read other posts about this but decided to create a new one since everything is several years old now. I tried usingx instead of enc_output but it goes against my understanding of what we’re doing with the transformer. I wonder if using attn_weights_block1 is the right path but I’m getting the same error after using them.

gent.spah · April 8, 2025, 4:45am

Are you using the return_attention_scores parameter in self.mha2, at all?

sergio_cr · April 8, 2025, 2:18pm

Yes, I have set them to True. Wether True or False, I’m getting this error.

TMosh · April 8, 2025, 3:36pm

Perhaps the error is not with the mha2() call. That’s just a blanket suggestion from the notebook test case - it often is not the actual cause.

sergio_cr · April 8, 2025, 5:38pm

Interesting. Assuming the error is not in the self.mha2() call, I tried braking other parts of the code in this cell before calling the multi head attention block 2. I’m also assuming previous cells are correct given that they have all returned positive test results. This would mean the error may be in defining Q1, mult_attn_out1, or attn_weights_block1.

Block 1 mha: I’m using input x three times to perform self-attention plus a look ahead mask and returning attention weights. If I use enc_output instead of x I get an error at self.mha1. I wonder if I also need to indicate that I’m training to perform dropout but in my attempts this has also failed.

Q1: This is pretty straightforward, I’m adding mult_attn_out1 and enc_output.

Update: I know realize I had to sum mult_attn_out1 and x, instead of enc_output. This solves my issue.

Topic		Replies	Views
Week 4 A1 UNQ_C6: Wrong values in attn_w_b2 Sequence Models coursera-platform	1	818	March 5, 2022
Exercice 6 AssertionError: Wrong values in attn_w_b2. Check the call to self.mha2 Sequence Models coursera-platform	6	102	June 25, 2024
C5W4 - Cell #20. Can't compile the student's code. Error: AssertionError('Wrong values in attn_w_b2. Check the call to self.mha2') Sequence Models coursera-platform	5	1629	October 2, 2022
DecoderLayer error in test Sequence Models coursera-platform	3	629	May 18, 2022
Sequential Model - Week 4 Sequence Models coursera-platform	3	628	August 8, 2021

C5_W4_A1 Wrong values in attn_w_b2. Check the call to self.mha2

Related topics