C5W4 exercise 6 DecoderLayer call method

Kyle_Paterson · April 25, 2022, 8:23am

Hello. I have managed to implement the calculation of the attention output (mult_attn_out2) of the second MHA block successfully, at least they pass the unit tests, but the values in the output of the feed forward network are wrong according to the final unit test. So something is going wrong with the add+norm layer of the second MHA block, the FFN layer, the dropout layer, the final add+norm layer, or some combination of all these. But I cannot find the problem. My code is attached below, hopefully someone can see what I cannot.

DecoderLayer call method code

mentor edit: code removed

Error Message

AssertionError Traceback (most recent call last)
in
1 # UNIT TEST
----> 2 DecoderLayer_test(DecoderLayer, create_look_ahead_mask)

~/work/W4A1/public_tests.py in DecoderLayer_test(target, create_look_ahead_mask)
180 assert np.allclose(attn_w_b1[0, 0, 1], [0.5271505, 0.47284946, 0.], atol=1e-2), “Wrong values in attn_w_b1. Check the call to self.mha1”
181 assert np.allclose(attn_w_b2[0, 0, 1], [0.32048798, 0.390301, 0.28921106]), “Wrong values in attn_w_b2. Check the call to self.mha2”
→ 182 assert np.allclose(out[0, 0], [-0.22109576, -1.5455486, 0.852692, 0.9139523]), “Wrong values in out”
183
184

AssertionError: Wrong values in out

TMosh · April 25, 2022, 2:09pm

The “output of the first block” is Q1, not mult_attn_out1.
Also, the only layer that needs “training = training” is the dropout_ffn() layer.

Kyle_Paterson · April 25, 2022, 2:21pm

That sorted it, thanks. I knew it was something stupidly simple.

Topic		Replies	Views
C5W4 A1 excercise 6 DecoderLayer Sequence Models week-4 , coursera-platform	5	268	March 7, 2024
C5 W4 A1 DecoderLayer Ex6 Sequence Models coursera-platform	11	699	November 18, 2024
C5W4A1 Exercise 6 "Wrong values in out" Sequence Models coursera-platform	10	628	December 2, 2023
C5_W4_A1 Decoder_layer self.mha1 Sequence Models coursera-platform	12	845	August 7, 2021
C5W4 - Cell #20. Can't compile the student's code. Error: AssertionError('Wrong values in attn_w_b2. Check the call to self.mha2') Sequence Models coursera-platform	5	1629	October 2, 2022

C5W4 exercise 6 DecoderLayer call method

Related topics