Error in C5_W4_A1 Part4(EncoderLayer) instructions

ddneilson · March 22, 2025, 8:59pm

In version “v1.6” of the C5_W4_A1_Transformer_Subclass_v1 there is an error in the comments of Part 4’s coding assignment that is bound to trip people up.

Where it says “pass the output of the multi-head attention layer through a ffn (~1 line)” in exercise 4 it should say “pass the output of the normalized multi-head attention layer through a ffn (~1 line)”

TMosh · March 22, 2025, 9:17pm

Thanks for your report.

paulinpaloalto · March 22, 2025, 11:46pm

Maybe that would make it clearer, but here’s the comment in the template code for the previous line of code:

# skip connection
# apply layer normalization on sum of the input and the attention output to get the  
# output of the multi-head attention layer (~1 line)

In other words what they mean by “the output of the multi-head attention layer” is only ambiguous if you missed the point of the previous comment. And the diagram in Figure 2a.

But more clarity is never a bad thing …

Topic		Replies	Views
[Week4] programming assignment, EncoderLayer, misleading comment Sequence Models coursera-platform	2	914	November 10, 2021
Week 4 - Exercise 4 - Encoder Layer Sequence Models coursera-platform	5	752	June 11, 2023
[Week4]C5_W4_A1_Transformer_Subclass_v1 Exercise 4 - EncoderLayer Sequence Models coursera-platform	19	1648	August 17, 2024
C5 W4 A1: Encode layer dropout error Sequence Models coursera-platform	2	674	January 30, 2022
[Course 5] - Week 4 \| Transformers - EncoderLayer() training error Sequence Models coursera-platform	11	635	October 31, 2025

Error in C5_W4_A1 Part4(EncoderLayer) instructions

Related topics