How to determine the input format of tensor for the evaluation/prediction mode in Trax

David_C1 · June 26, 2023, 3:05am

Hi all,

For C4_W2_Assignment in the Natural Language Processing Specialization, I found it kind of confusing for determining the input for the transformer model(See attached fig). I think originally for training the model, we need three arrays (X,Y and mask), while here the input only has a dimension of two??? We should we neglect the mask array?

More in general, is there a way to better understanding the input-dimension and output-dimension?

gent.spah · June 26, 2023, 8:09am

This is complex model with many parts and you need to go through from the beginning in order to understand how those parts are put together to arrive at the explained output.

arvyzukai · June 26, 2023, 8:18am

Hi @David_C1

There was a previous similar question that might suggest you some ideas.

Cheers

David_C1 · June 26, 2023, 7:11pm

Hi Gent, thanks for replying. I wonder is there a systematic way to understand the input/output parameters of each layer or the whole model in Trax? Thanks!

David_C1 · June 26, 2023, 7:11pm

Hi ArvyZukai, thanks for replying. I wonder is there a systematic way to understand the input/output parameters of each layer or the whole model in Trax? Thanks!

arvyzukai · June 27, 2023, 6:35am

Hi @David_C1

I’m not sure I can better explain than the trax documentation (especially “Layers are trainable.”, " Layers combine into layers." sections and “2. Inputs and Outputs”).

You can play around in your Coursera labs by trying to initialize simple tl.Dense() layer with your own input (don’t forget to initialize with input.signature).

Cheers

gent.spah · June 27, 2023, 7:11am

Hello @David_C1 you could follow up through the notebook and try understand as much as you can. Other external resources that Arvydas suggests can help also.

Topic		Replies	Views
UNQ_C9: Model input NLP with Attention Models week-2	6	519	June 27, 2023
Question about using NN to predicting sentiment NLP with Attention Models week-1	1	316	September 28, 2023
Trax and mean layer NLP with Sequence Models week-1	4	572	December 3, 2022
W4 Assignment 1 Exercise 8 Are the input dimensions of our transformer model correct Sequence Models week-4	2	249	January 9, 2024
Where can I find the reference on what input should be passed to the model in the evaluation step? NLP with Sequence Models week-4	5	542	November 17, 2022

How to determine the input format of tensor for the evaluation/prediction mode in Trax

Related topics