Week 4 - Assignment 1 - Exercise 4

BrynjarGeir · May 31, 2021, 3:34pm

Hi, I’m having a bit of a problem with the Encoding Layer exercise. I don’t really know how to call the functions. I think I’ve read through the text and don’t quite get it. For the mha I first thought that I should pass x.shape[0], x.shape[1],x.shape[2] and then after that (x) or some sort of version of that. The shape/value inside mha and then pass x. But that doesn’t seem to be working. I quess I would run in to the same problem with the other calls where there is a comment at the end of the line like

(batch_size, input_seq_len, embedding_dim)

Anybody have any idea of what is the right way to go here? For the dropout I’m just using training and the passing the output from prev. Should I also have add x and attn_output together before passing them as arguments and do the same when in the second layer normalization but just with different arguments? I just don’t know what we are supposed to pass as arguments. Any help or input would be appreciated.

kvamvake · May 31, 2021, 6:41pm

I have the same issue

BrynjarGeir · May 31, 2021, 9:19pm

Ok so I was googling about and found the solution pretty much. I was trying to google how to call the mha and didn’t find anything for some time until I did. I was always using the shape of x but you should just use x. Don’t know why and wasn’t looking for the solution. I was googling somehting like:

how to use multiheadattention custom layer examples

And eventually I got to a tutorial on the Keras page which has this exact function. Again wasn’t looking for that exact thing but rather how to call mha in general. But I found it and couldn’t unsee it. You can find it the same way, kvamvake, but hopefully someone can help explain why this is the case.

Topic		Replies	Views
W4_Ex-4 \| (Encoder) - Stuck with no clue what to do next! Please help! Sequence Models coursera-platform	14	1673	November 9, 2022
A perspective for C5W4A1 EX4 Sequence Models week-4 , coursera-platform	3	113	October 2, 2024
C5_W4_A1 Exercise 4 Encoder Layer Sequence Models coursera-platform	15	1125	July 12, 2023
C5 W4: Exercise 4 EncoderLayer() At least need to know Sequence Models coursera-platform	13	638	December 3, 2022
Week 4 Assignment Encoder Wrong Values Sequence Models coursera-platform	3	799	October 15, 2022

Week 4 - Assignment 1 - Exercise 4

Related topics