May I know what exactly does Tl.shiftright do?

tyyim · November 14, 2021, 12:42pm

May I know what does the “shiftright” layer do?

model = tl.Serial(
tl.ShiftRight(mode=mode), # Stack the ShiftRight layer
tl.Embedding(vocab_size=vocab_size,d_feature=d_model), # Stack the embedding layer
[tl.GRU(n_units=d_model) for _ in range(n_layers)], # Stack GRU layers of d_model units keeping n_layer parameter in mind (use list comprehension syntax)
tl.Dense(n_units=vocab_size), # Dense layer
tl.LogSoftmax() # Log Softmax
)

ai_curious · November 14, 2021, 6:24pm

Is there a specific question beyond what is documented here ==> [trax.layers — Trax documentation]

Phillip_Martin · November 24, 2021, 2:55am

It took me awhile to understand the purpose of this layer. But I think I get it now. Note that, in the Week 2 assignment, the inputs are targets are the same. If the input was “I am hungry” and the target was also “I am hungry,” the RNN would have the easy task of simply setting the output to the input for every step. The ShiftRight layer changes the input to " I am hungry." Thus the RNN now tries to predict “I” from “”, “am” from " I", and “hungry” from " I am."

tyyim · November 25, 2021, 2:25pm

Thanks for sharing the insight

Topic		Replies	Views
Creating a GRU model using Trax NLP with Sequence Models week-2	3	734	July 26, 2022
tl.ShiftRight layer is by default nested inside the Serial Combinator NLP with Attention Models week-1	3	513	December 24, 2022
How do the GRULM really work? NLP with Sequence Models week-2	1	540	July 10, 2022
Question: tl.ShiftRight() NLP with Sequence Models week-2	2	534	December 28, 2022
RNN Concepts too confusing NLP with Sequence Models week-2	3	732	July 31, 2023

May I know what exactly does Tl.shiftright do?

Related topics