Small typos in W4 assignment

Izak_van_Zyl_Marais · November 8, 2022, 7:21am

Hi,

I found the following typos in the “Programming Assignment: Transformers Architecture with TensorFlow”:

In this section describing sequence lenghts, the final example suddenly has 4 truncated sequences in stead of 3:

which might get vectorized as:
[[ 71, 121, 4, 56, 99, 2344, 345, 1284, 15],
[ 56, 1285, 15, 181, 545],
[ 87, 600]
]
When passing sequences into a transformer model, it is important that they are of uniform length. You can achieve this by padding the sequence with zeros, and truncating sentences that exceed the maximum length of your model:
[[ 71, 121, 4, 56, 99],
[ 2344, 345, 1284, 15, 0],
[ 56, 1285, 15, 181, 545],
[ 87, 600, 0, 0, 0],
]
Sequences longer than the maximum length of five will be truncated

Extra in highlighted below:

5 - Decoder
The Decoder layer takes the K and V matrices generated by the Encoder and in computes the second multi-head attention layer with the Q matrix from the output (Figure 3a).

Finally in the code comment, there is an extra is

class Decoder(tf.keras.layers.Layer):
“”"
The entire Encoder is starts by passing the target input to an embedding layer
and using positional encoding to then pass the output through a stack of

Mubsi · November 8, 2022, 1:45pm

Hi @Izak_van_Zyl_Marais,

Thanks for letting us know. We’ll have these fixed soon.

Regards,
Mubsi

Mubsi · November 15, 2022, 9:48am

Hi @Izak_van_Zyl_Marais,

I have fixed two of the typos you pointed out. Thanks again!

As for the first,

The sequence length is set as 5, so when the first sentence is considered, after the first 5 words, it stops and creates another sequence. That’s why the first example is broken into two sequences. This is also mentioned in what you shared above.

Best,
Mubsi

Izak_van_Zyl_Marais · November 15, 2022, 11:12am

Thanks for the feedback.

Topic		Replies	Views
Course 5 Week 4 Assignment Exercise 8 code comment typo? Sequence Models	1	591	July 21, 2021
W4A1 target_sequence_length? Sequence Models week-4	1	10	March 11, 2025
Week 4 : 10 words are vectorized to only 9 numbers? Sequence Models	1	513	June 15, 2022
W4 - Help with training the Transformer model built in the assignment Sequence Models	2	727	May 17, 2021
Natural Language Processing in TensorFlow: Week 3: Exploring Overfitting in NLP Natural Language Processing in TensorFlow	9	611	September 1, 2022

Small typos in W4 assignment

Related topics