Error in practice quiz question 10

CWKoo · January 21, 2023, 11:59am

The correct answer for this question asserts that the BLEU score is better for the original transformer than for the reversible layer. However, the opposite is true, according to the lecture video (5:11) and the reformer paper (see Table 4).

arvyzukai · January 23, 2023, 7:02am

Hi @CWKoo

Welcome to community

Well… it’s quite ambiguous - the video mentions (5:27) “… It’s really because there’s been some hyperparameter tuning into three years since the original transformer paper was published. …” I personally interpret that as the Reformer having the advantage of time (2020 vs. 2017-2018).

and also in the Reformer paper the “big” model has better scores:

In the Lecture video Reformer is compared to 2017 version (and strangely not to 2018).

Theoretically Reformer should not outperform regular Transformers on these quality metrics (since Reformer is optimized for faster training/inference and less memory requirements while loosing minimally on quality).

But the way the Quiz question Nr. 10 is formulated is actually the opposite of this (or at least ambiguous) - for me, it suggests that the Reformer is the older architecture by 3 years than the regular Transformer (which is obviously not true) and this is the reason why it has better scores…

I will submit for a better formulation of the question.

Thanks for bringing this up.
Cheers

Topic		Replies	Views
Reversible Transformer NLP with Attention Models week-4	1	395	September 19, 2023
What about Reformer, can this architecture help? Generative AI with Large Language Models week-1	4	425	October 24, 2023
Reversible Residual Layers: Cannot understand y_2 = x_2 + FeedFwd(y_1) NLP with Attention Models week-4	8	530	September 4, 2023
What is major difference between BLEU score and BLEU modified NLP with Attention Models week-1	6	319	April 8, 2024
Not understanding how Reversible Layers in the Reformer saves memory NLP with Attention Models week-4	3	362	September 4, 2023

Error in practice quiz question 10

Related topics