W4_Ex-4 | (Encoder) - Stuck with no clue what to do next! Please help!

santoshsastry · May 28, 2021, 9:23pm

Hi there -

I am completely stuck at the Course5 Week4 Exercise 4 (UNQ_C4) step. I have spent a lot of time reading through Keras documentation, going through the function step by step, trying several combinations based on the instructions provided etc. but I have no idea how to get the call() function to work.

At this point, I am completely frustrated and am close to calling it quits. I’ve managed to work through all prior assignments from Course 1 through Course 5 week3 on my own (after a bit of struggling in some cases), but I feel like I’ve hit an impenetrable wall this time around.

I don’t want to post my code as it likely goes against the Honor Code, but I think my issue is with the initial call to the self.mha() layer

START CODE HERE

    # calculate self-attention using mha(~1 line)
    attn_output = self.mha()  # Self attention (batch_size, input_seq_len, fully_connected_dim)

Can anyone point me in the right direction?

/santosh

TMosh · May 28, 2021, 9:44pm

Lots of students have difficulty with this assignment - it isn’t very well-written.

Have you tried searching on the forum here for posts from other students? There has been a lot of discussion about it.

/// Update 11/2022 ///
For self attention, you call self.mha(…) with x for all three K, Q, and V arguments, and also pass the mask. This is discussed in Sections 3, 4, and 4.1.

santoshsastry · May 28, 2021, 10:41pm

Thanks @TMosh! I tried looking through some of the existing threads but couldn’t find anything that was immediately helpful to get me unstuck.
I can give it another go (after my brain decompresses a bit), but am not sure how this will help! Would appreciate other suggestions!

Thanks,
/santosh

kleber · June 2, 2021, 5:18pm

@santoshsastry I’m stuck in there too. This assignment is by far the most challenging… and it lacks some of the typical hints in the code to help a little more
So far it helped me a lot reading the Transformer documentation.

santoshsastry · June 2, 2021, 5:40pm

@kleber - yeah, it is challenging and the notebook documentation is not very helpful. I had to spend a ton of time reviewing the TF/keras documentation and looking through examples to understand how to proceed further. I managed to complete the course successfully, but it took a lot more effort and time than I had anticipated.

Best of luck!
/santosh

kleber · June 2, 2021, 6:01pm

I also managed to complete it. But I must say this last Week’s topic is incredibly complex… Reading the Transformer/Encoder/Decoder documentation is absolutely essential for this assignment. I think it could be better designed so the student could absorb the concepts more concretely. I’ll think about a good feedback on it.

Vaishnav · June 2, 2021, 7:03pm

I am also stuck here. Can anyone of you who completed help me out…

hanhtinhxanhdo · June 3, 2021, 4:16am

You can try searching documents on tensorflow documents: Text | TensorFlow

hanhtinhxanhdo · June 3, 2021, 4:17am

You can try searching documents on tensorflow documents: Text | TensorFlow

liuzifeng · July 10, 2022, 3:59am

Hi @TMosh
I got this error and I am stuck here for a long time. I read a blog suggests to use mask in the attention layer but I don’t know how, Could u please help me?

TMosh · July 10, 2022, 4:15am

In EncoderLayer(), the only use of “mask” is with self.mha().
The only use of “training=training” is with self.dropout_ffn().

liuzifeng · July 10, 2022, 4:34am

Thank u so much!!
I solved this problem

Charlie_Lindsay · September 3, 2022, 7:02pm

Didn’t solve the problem for me. Variable names are awful. Whoever wrote this needs to look at quality of other programming exercises and learn

Michael_J_Schulz · November 9, 2022, 10:05pm

Your answer saved my day. But why is the usage from mask different from the usage of training, as a parameter?
I don’t want to post the code here, but this is very strange for me.

TMosh · November 9, 2022, 10:38pm

The masking is discussed in Sections 2.1 and 2.2.

Topic		Replies	Views
A perspective for C5W4A1 EX4 Sequence Models week-4 , coursera-platform	3	113	October 2, 2024
C5 W4: Exercise 4 EncoderLayer() At least need to know Sequence Models coursera-platform	13	638	December 3, 2022
Week 4 - Assignment 1 - Exercise 4 Sequence Models coursera-platform	2	683	May 31, 2021
Issues with computing self_mha_ouput Sequence Models coursera-platform	5	652	February 8, 2024
C5_W4_A1 Exercise 4 Encoder Layer Sequence Models coursera-platform	15	1125	July 12, 2023

W4_Ex-4 | (Encoder) - Stuck with no clue what to do next! Please help!

START CODE HERE

Related topics