C4W2-Exercise2

Deepti_Prasad · May 27, 2024, 1:37pm

First of thank you for putting efforts from your side to understanding what you are trying to learn. It feels great as a mentor when we come across learners who also put sincere effort. Once you complete NLP, do complete the Deep Learning Specialisation. Believe me you will not regret.

Now comes to your code, I will go step wise.

after calculate self attention for block 1(codes were correct for that), you had to apply layer normalization (layernorm1) to the sum of the attention output and the input
You used correct self.layer recall but there are two mistake, you didn’t required to use tf.add and second mistake use the simple method of addition to sum attention output and x but you have used ((mult_attn_out1, x)
Read the instruction for applying layer normalisation here you had to apply layer normalization to the sum of the attention output and the output of the first block but you have basically summed up attention output1 and attention output2 which is incorrect here. Also the same mistake mentioned in point 1, not to use tf.add and use the addition operator to add the mult_attn_out2 and Q1 which is the output of the first block.
BLOCK3. Next instruction mentioned was pass the output of the second block through a ffn, but added a padding mask to the block3 which was not required here.
next the code instruction mentions
apply a dropout layer to the ffn output
use training=training
But you missed adding training=training while applying dropout layer to the fun output
Again to apply layer normalization (layernorm3) to the sum of the ffn output and the output of the second block, please remove the tf.add and use addition operator to the mentioned output in instruction which you chose correctly.

Regards
DP

Topic		Replies	Views
C5_W4_A1_exercise6 Sequence Models week-4	2	135	May 28, 2024
Question about C4_W2_Assignment on Exercise 2 - DecoderLayer NLP with Attention Models week-2	8	430	February 19, 2024
#C4W2 - Exercise 4 Transformer error NLP with Attention Models week-2	7	295	February 20, 2024
Course_5_Encoder layer Sequence Models	2	784	August 25, 2021
Week 4 - Exercise 4 - Encoder Layer Sequence Models	5	724	June 11, 2023

C4W2-Exercise2

Related topics