C3_W4 UNQ_C5 : problem with loading the weights

Florent_Tho · September 16, 2022, 10:10am

Hello,

When I try to compute the accuracy with the Siamese model, I get an error that I don’t manage to solve (see trace below)

I suppose there is a discrepancy between the model I build and the shape of the weights in the pretrained model.
I tried to recharge the workbook several time, to no avail.

How can I investigate this error and find the problem ?
I didn’t find much usefull on the web

LayerError: Exception passing through layer Parallel (in pure_fn):
  layer created in file [...]/<ipython-input-20-54e6716dd7ce>, line 29
  layer input shapes: ShapeDtype{shape:(512, 64), dtype:int64}

  File [...]/trax/layers/base.py, line 707, in __setattr__
    super().__setattr__(attr, value)

  File [...]/trax/layers/base.py, line 454, in weights
    f'Number of weight elements ({len(weights)}) does not equal the

ValueError: Number of weight elements (512) does not equal the number of sublayers (2) in: Parallel_in2_out2[
  Serial[


    Embedding_41699_128
    LSTM_128

    Mean
    Normalize

  ]
  Serial[

    Embedding_41699_128
    LSTM_128

    Mean
    Normalize

  ]
]

Florent_Tho · September 16, 2022, 1:12pm

I’m really puzzled.
it seems that defining the q1, q2 creates difficulties.

for example :

reinoudbosch · September 20, 2022, 1:34am

Hi Florent_Tho,

The comment above the invocation of the data generator states

“# use batch size chuncks of questions as Q1 & Q2 arguments of the data generator. e.g x[i:i + batch_size]”

Are you doing this in your code? The example you give [0:512] with a batch size of 10 does not conform to this comment.

zzhu24 · October 2, 2022, 5:31am

I have run into the exact same issue. I think I have correctly used the “# use batch size chuncks of questions as Q1 & Q2 arguments of the data generator. e.g x[i:i + batch_size]”

Here is the data generator from my code:
q1, q2 = next(data_generator(test_Q1[i:i + batch_size], test_Q2[i:i + batch_size],pad =vocab[‘’], batch_size=batch_size,shuffle=False))

then when I call model(q1,q2), the same issue happens again.

my lab id is nhbjjhab

ARNAV_GUPTA1 · October 2, 2022, 11:04pm

Hint: use vocab['<PAD>'] for the pad argument of the data generator . This was the hint given to solve the question. You are using pad = vocab[‘’]. I guess using this hint you will be able to solve the question.

zzhu24 · October 3, 2022, 3:37am

Thanks for the reply, but I was using vocab < PAD >. Sorry when I copy and paste it did not come over correctly.

This screenshot below is a test to replicate the issue I have and ran into the exact same issue.
I have this issue where I call next() on the generator to get q1 and q2 inputs, but when I call the model with q1 and q2 it fails.

anchyzas · October 10, 2022, 10:41am

Hey, I just solved the exact same problem. Try to pass them to the model as a tuple, with two pairs of brackets, like so:

model((temp1, temp2))

glicerico · October 17, 2022, 4:37am

It’s a shame there’s no single example how to run the trax’s Parallel model for inference. But @anchyzas gave the right answer, add both question lists as a tuple (or list), as Parallel takes both inputs as the first argument.

joaopedrovtp · November 29, 2022, 4:52pm

yea, that solved my problem too! thanks!

leggard · September 13, 2023, 1:22pm

Thanks, this helped me too!

O1ena · October 25, 2023, 8:22pm

Thank you!!

Topic		Replies	Views
Course 4 Week 1 Ex 6: ValueError NLP with Attention Models week-1	10	575	September 6, 2023
Error with next_symbol (UNQ_C6) NLP with Attention Models week-1	3	446	July 3, 2023
NLP C3W3 Exercise 1 LSTM layer dimensions error NLP with Sequence Models week-3	5	29	December 27, 2024
Assignment 3: Question duplicates_Exercise 01 _siamese NLP with Sequence Models week-3	5	29	November 6, 2024
C4W1 Error when loading pre-trained model NLP with Attention Models week-1	2	569	June 2, 2022

C3_W4 UNQ_C5 : problem with loading the weights

Related topics