C4W1_Assignment : Tensor of logits dimension mismatch

WASIOU_JAHARABI · February 16, 2024, 5:12pm

Tensor of contexts has shape: (64, 14, 256)
Tensor of right-shifted translations has shape: (64, 15)
Tensor of logits has shape: (64, 12000)

Expected Output

Tensor of contexts has shape: (64, 14, 256)
Tensor of right-shifted translations has shape: (64, 15)
Tensor of logits has shape: (64, 15, 12000)

I am unable to debug why I am getting a logit shape of (64, 12000). Where could it have gone wrong?

paulinpaloalto · February 16, 2024, 5:39pm

The difference between a 2D tensor and a 3D tensor is pretty fundamental, right? So you must be misinterpreting the operations that the math formulas are telling you to do. One way to debug this is to add print statements in the relevant parts of the code to print the shapes of all the objects. That should at least let you narrow it down to the line of code that is incorrect.

I’m not familiar with the NLP C4 material, but have done the DLS equivalent in DLS C5 W4. One thing to be careful about is the notational conventions for the difference between dot product style multiply and elementwise multiply. In DLS Prof Ng is consistent in that he always and only uses “*” as the operator to signify “elementwise” multiply. If he does not write an explicit operator between two tensors or array objects, then the operation is “dot product multiply”.

WASIOU_JAHARABI · February 16, 2024, 7:18pm

Thank you for your response. My actual confusion is in the C4W1_assignment’s decoder’s layers. I am guessing I have misplaced some parameters of the decoder’s layers as a result the tensor has a different dimension than expected and I have been trying to figure it out.

WASIOU_JAHARABI · February 16, 2024, 7:31pm

I figured it out. Thanks.

paulinpaloalto · February 16, 2024, 7:48pm

Nice work! Thanks for confirming.

ledai0912 · February 25, 2024, 11:41am

I’ve had the same problem, can you tell me how I can solve it ? Thank you so much !!!

Javier_EHS · February 28, 2024, 3:15pm

same here I get

“Tensor of logits has shape: (64, 12000)”

and I have no idea how to fix it

Javier_EHS · February 28, 2024, 3:31pm

finally I figured out, in the instructions it says:

" Post-attention LSTM. Another LSTM layer. For this one you don’t need it to return the state."

but I missinterpreted it as return_sequences=False when it should be return_sequences=True

hope this helps

WASIOU_JAHARABI · February 28, 2024, 3:32pm

In my case, I was not returning the sequences in the post attention rnn. please check that.

Topic		Replies	Views
Dimensional size error for C4W1_Assignment Decoder test NLP with Attention Models week-1	29	927	March 18, 2025
NLP C4W1 Exercise 3 Decoder NLP with Attention Models week-1	2	452	January 12, 2024
C4W1 - Ex3 - Incorrect third dimension of decoder output NLP with Attention Models week-1	1	299	February 29, 2024
C4W1 Assignment - Exercise 3 Decoder Function NLP with Attention Models week-1	6	357	May 24, 2024
Support with C4W1 assignment - NLP with attention models NLP with Attention Models feedback , week-1	2	206	May 31, 2024

C4W1_Assignment : Tensor of logits dimension mismatch

Expected Output

Related topics