[Course 5] - Week 4 | Transformers - EncoderLayer() training error

Rafael_Oliveira · September 11, 2022, 12:26am

Hi

I’ve been stuck for a while on Course 5, week 4 on the EncoderLayer() as many in this forum. It is obvious that this week should’ve been removed or reworked because there’s very little link or help on this.

I keep getting this error:

I’d put my code here but apparently it’s not legal… For whatever reason.

Any solution? At this point I’d just like to get this done as there’s no link between the course videos and the programming assignment

Thank you

Elemento · September 11, 2022, 6:51am

Hey @Rafael_Oliveira,
Welcome to the community.

I partially agree with you on this as well. Prof Andrew indeed spends a little less time on teaching the ins and outs of Transformers as compared to the other concepts in this specialization, but at the same time, he wants the learners to learn how to figure out a concept by themselves. Assuming a learner has learnt in all the previous courses and the previous weeks of the 5th course meticulously, in my opinion, the assumption that the interested learners will try to delve deeper into transformers by themselves makes complete sense. Also, there’s only so much that Prof Andrew can discuss in a single course, otherwise, the learners will lose interest and will not complete the course anyways.

As to this, the reason is pretty straight-forward. If we put the solutions in the public forums, then the learners will simply refer to the solutions when they are stuck. I don’t see any learning happening there. If you want that, why to have the assignments in the first place at all? We could have simply included notebooks with all the code included.

Now, coming to your query, the error clearly asserts that your implementation for the EncoderLayer is wrong. Can you please DM your implementation for the same to me?

Cheers,
Elemento

Rafael_Oliveira · September 11, 2022, 10:27am

Hello, Elemento

Thank you for your reply.

I’ll be sending my implementation via DM. Thank you once again for your time

Jade · April 16, 2023, 1:14pm

I have exactly the same error. Did you solve the problem? Could you share your findings? Thank you very much!

Jade · April 16, 2023, 2:07pm

I could fix the problem. Just for anyone who will face the same problem, please pay attention to the hint "
# apply layer normalization on sum of the output from multi-head attention (skip connection) and ffn output to get the
# output of the encoder layer (~1 line)

Elemento · April 19, 2023, 11:21am

Hey @Jade,
Welcome back. Thanks for sharing your insights with the community.

Cheers,
Elemento

william27 · August 4, 2023, 10:43pm

These forums are supposed to have answers. I am the third person (at least) to have this problem, and this forum says nothing about how I might solve it. I have the same exact problem as Rafael. Can you also leave a clue or answer in this forum so others who might have this problem can understand why, please?

william27 · August 5, 2023, 1:40am

For anyone who is having the same problem, I found the solution. I was adding self_mha_output to ffn_output instead of adding skip_x_attention to ffn_output for the second layer norm

Elemento · August 20, 2023, 1:54pm

Hey @william27,
Apologies for the delay, and your poor experience; and thanks a ton for sharing the insights with the community.

Please note that we try our best to answer as many queries as we can, but sometimes in the lot of queries, we, as mentors, tend to miss out on certain queries, due to multiple reasons. We hope that you won’t have to go through the same experience again.

Cheers,
Elemento

Topic		Replies	Views
C5W4 Exercise 4 - EncoderLayer Sequence Models	2	755	July 26, 2021
Course 5, Week 4: Transformer - Exercise 5 (Encoder) and 7 (Decoder) Sequence Models	3	419	September 21, 2023
C5W4: Transformer Architectures with TensorFlow Sequence Models week-4	40	4469	August 3, 2024
Course 5 Week 4 : Can't find exercise 4 Sequence Models	7	670	November 28, 2021
[Week4]C5_W4_A1_Transformer_Subclass_v1 Exercise 4 - EncoderLayer Sequence Models	19	1580	August 17, 2024

[Course 5] - Week 4 | Transformers - EncoderLayer() training error

Related topics