C5_W4_A1_Transformer_Subclass_v1_Encoder

Reghu · August 23, 2021, 8:04am

Hello,
I’m currently a bit lost in trying to use the MultiHeadedAttention.
My code currently looks like this:

# START CODE HERE
# mentor edit - code removed
# END CODE HERE

But I’m getting the error:
AssertionError: Wrong values when training=True

I tried reading the hints. But I don’t know where to the the values Q,V and K to pass to the mha. I thought the mha was supposed to calculate these internally.

TMosh · August 24, 2021, 1:18am

For self-attention, you use ‘x’ three times (for all three of the Q, V, and K matrices). And you pass the “mask” parameter. Do not use the training argument there.

In encoder_layer_out, use “out1”, not “attn_output”.

Reghu · August 25, 2021, 3:12am

Thank you. It works now.

Topic		Replies	Views
C5 W4 A1: Question about MultiHeadAttention Sequence Models	2	736	August 4, 2021
Programming Assignment: Transformers Architecture with TensorFlow encoderlayer Sequence Models week-4	2	393	January 23, 2024
C5_W4_A1 Exercise 4 Encoder Layer Sequence Models	15	1124	July 12, 2023
C5W4: Transformer Network Sequence Models	4	963	May 30, 2023
C5W4A1 - UNQC4 - EncoderLayer - ValueError: The first argument to `Layer.call` must always be passed Sequence Models	4	570	April 3, 2023

C5_W4_A1_Transformer_Subclass_v1_Encoder

Related topics