tl.ShiftRight layer is by default nested inside the Serial Combinator

In pre_attention_decoder_fn and in other functions, I am not able to use tl.ShiftRight layer is by default nested inside the Serial Combinator. What to do? Anyone can please help.
pip install trax installed trax 1.4.1

Hi @shubhsr

I have encountered the issue in the past and cannot quite remember how I solved it. I think it was something with layer initialization or input size, but I’m really not sure :slight_smile:

By quick search I can find these issues were posted:

I cannot pinpoint the exact problem and if I remember how I got around it I will post it.

Cheers

The solution worked, but still, the same problem is in tl.AttentionQKV(…) in NMTAttn function while running the AttentionQKV layer