Exercise - 4 Transformer

Mano_Bharathi_M · June 10, 2023, 1:15am

Programming Assignment: Transformers Architecture with TensorFlow
I have been stuck in this one assignment for past 2 days… will anyone can help me to solve this ? or can mentors reset my cassignment

TMosh · June 10, 2023, 2:28am

It appears the first problem is with your scaled_dot_product_attention() function. The error message tells you it has to do with how it uses the mask parameter.

Given that error, none of the other functions will work correctly.

The grader tests your code with different data than is used in the notebook.

Mano_Bharathi_M · June 10, 2023, 2:50am

# moderator edit: code removed

Mano_Bharathi_M · June 10, 2023, 2:58am

can you reset my lab to the initial version. I think i messed up it…

paulinpaloalto · June 10, 2023, 3:43am

You can do that yourself. The first topic on the DLS FAQ Thread shows how. The mentors can’t do it for you in any case.

TMosh · June 10, 2023, 4:09am

Please don’t post your code on the forum. That isn’t allowed by the Code of Conduct.
I’ll edit your message to remove the code.

Hints:

Try using the last dimension of ‘k’ to get the shape. Not always dimension 0.
Read the “Reminder” that’s part of the Exercise 3 instructions.
You haven’t specified an axis for the softmax activation to use.

Topic		Replies	Views
[Week 4] scaled_dot_product_attention Sequence Models	11	1329	May 7, 2023
C5_W4_A1_Transformer_Subclass_v1 Scaled_dor_product_attention Sequence Models	11	809	August 23, 2021
A problem with the Programming Assignment: Transformers Architecture with TensorFlow Sequence Models week-4	1	41	November 15, 2024
Transformer Summarizer C4W2_Assignment Exercise 1 - scaled_dot_product_attention NLP with Attention Models week-2	3	214	April 30, 2024
Help needed for C5W4A1 EX-3 Sequence Models week-4	4	49	August 17, 2024

Exercise - 4 Transformer

Related topics