A perspective for C5W4A1 EX4

sushantnair · August 17, 2024, 6:24pm

Hi,

Having gone through this assignment, I’d love to share my perspective which would help other students.

When trying to solve this exercise, the following things should be gone through, step-by-step. Firstly,

This diagram has been provided. Please go through it carefully, merely by understanding the flow, 50% of problem is solved already.
The next section you should look closely is this:

Then next:

And finally, read carefully the helper comments just before each line of code to be written.
I feel if students follow this flow, it’ll be much easier. Part of the problem lies in the fact that:

Lack of understanding about the flow, i.e., which part is supposed to go where, etc.
Confusion regards how to call the functions
One more hint: for the first line, it goes like this:
self.mha(parameter_1=<input variable>, parameter_2=<input variable>, parameter_3=<input variable>, mask_parameter=<mask variable>)

Further, before going for coding, first take a screenshot of the diagram and keep it to the side. Then, as you write code, keep referring to the diagram so that the correct flow is in mind. When I was solving this assignment, this last aspect was the barrier between me and success.
Hopefully this is of help to someone.

Reference: C5_W4_A1 DON'T PANIC! Transformer help is on the way

I’d suggest going through the reference first and then my perspective. That’s just my suggestion; you could do whatever you like first

TMosh · August 17, 2024, 7:33pm

Thanks for your recommendations.

fabricio · August 18, 2024, 11:59pm

Thanks for sharing.

Andrew_Jewett · October 2, 2024, 3:49am

On first pass through the assignment, I somehow skipped over the second paragraph you highlighted here (the one that that explained the arguments that need to be passed to the self.mha() function). Your post reminded me to go back and re-read this carefully. And that got me unstuck. Thanks!

(Personally, I also found it helpful to read this documentation, which explained what the call() function does. After reading that, I was able to understand the rest of the keras layer documentation, including MultiHeadAttention.)

Topic		Replies	Views
W4_Ex-4 \| (Encoder) - Stuck with no clue what to do next! Please help! Sequence Models coursera-platform	14	1685	November 9, 2022
C5_w5_a1 unq_c4 Sequence Models coursera-platform	2	551	February 1, 2023
Course 5 Week4 Exercise 4 Sequence Models coursera-platform	7	1125	October 30, 2023
C5 W4: Exercise 4 EncoderLayer() At least need to know Sequence Models coursera-platform	13	643	December 3, 2022
C5 - W4 - Transformers Architecture, 3rd June 2021 version Sequence Models coursera-platform	2	744	June 9, 2021

A perspective for C5W4A1 EX4

Related topics