C3W3 NLP - Ex 2 - Struggling with data types

hpsv · February 29, 2024, 4:05pm

Hello,

I struggle a lot with exercise 2: I am not able to get rid of the following error, even when casting all intermediate results to tf.float64 when possible. I noticed that scores.dtype is float64 when I try to print it. Is that right?

This error is thrown by the line triplet_loss = tf.math.reduce_sum(triplet_loss1, triplet_loss2)

InvalidArgumentError: Value for attr 'Tidx' of double is not in the list of allowed values: int32, int64
	; NodeDef: {{node Sum}}; Op<name=Sum; signature=input:T, reduction_indices:Tidx -> output:T; attr=keep_dims:bool,default=false; attr=T:type,allowed=[DT_FLOAT, DT_DOUBLE, DT_INT32, DT_UINT8, DT_INT16, DT_INT8, DT_COMPLEX64, DT_INT64, DT_QINT8, DT_QUINT8, DT_QINT32, DT_BFLOAT16, DT_QINT16, DT_QUINT16, DT_UINT16, DT_COMPLEX128, DT_HALF, DT_UINT32, DT_UINT64]; attr=Tidx:type,default=DT_INT32,allowed=[DT_INT32, DT_INT64]> [Op:Sum] name:

Do you have any idea what I might have done wrong in my function?
Thanks!

Deepti_Prasad · February 29, 2024, 5:43pm

Hi @hpsv

scores.dtype is mentioned for mask_exclude_positives, so this is where the correction is required.

Read the below instructions which will help you do the correction

To create the mask, you need to check if the cell is diagonal by computing tf.eye(batch_size) ==1 , or if the non-diagonal cell is greater than the diagonal with (negative_zero_on_duplicate > tf.expand_dims(positive, 1) .

The instruction for closest negative
Remember that positive already has the diagonal values. Now you can use tf.math.reduce_max , row by row (axis=1), to select the maximum which is closest_negative .

Regards
DP

hpsv · February 29, 2024, 7:21pm

Thanks for your answer!

In my notebook the hint for closest_negative mention axis None instead of 1 - is that wrong?

I created the mask as tf.cast((tf.eye(batch_size) == 1) | (negative_zero_on_duplicate > tf.expand_dims(positive, 1)), scores.dtype): should I cast the mask as tf.float64 directly and not use scores.dtype?

Or since it is also used when we define the new batchsize as tf.cast(tf.shape(v1)[0], scores.dtype) is that where I should make the correction?

Apologies for the confusion!

Regards

Deepti_Prasad · February 29, 2024, 7:26pm

The hint is a bit misleading, even I was stuck because of the same, arvyzukai NLP mentor guided me through

So based on the instruction previous shared, axis should be 1

For the mask exclude positives, your code is correct.

Next thing, check if the score codes is recalled as (v2, v1, transpose_b=true), note v2 is placed before v1

hpsv · February 29, 2024, 7:33pm

Thanks for clearing the hint up!

Yep, the scores are computed this way indeed using tf.linalg.matmul as recommended in the instructions. I wonder if I made a mistake when computing the new batchsize (I use the number of lines from v1) but if I compute it from scores I should get the same because it’s a matrix of shape (batch_size, batch_size), right?

Deepti_Prasad · February 29, 2024, 7:55pm

What is the error you are getting now?

hpsv · February 29, 2024, 7:59pm

Still the same one at triplet_loss = tf.math.reduce_sum(triplet_loss1, triplet_loss2):

InvalidArgumentError: Value for attr 'Tidx' of double is not in the list of allowed values: int32, int64 ; NodeDef: {{node Sum}}; Op<name=Sum; signature=input:T, reduction_indices:Tidx -> output:T; attr=keep_dims:bool,default=false; attr=T:type,allowed=[DT_FLOAT, DT_DOUBLE, DT_INT32, DT_UINT8, DT_INT16, DT_INT8, DT_COMPLEX64, DT_INT64, DT_QINT8, DT_QUINT8, DT_QINT32, DT_BFLOAT16, DT_QINT16, DT_QUINT16, DT_UINT16, DT_COMPLEX128, DT_HALF, DT_UINT32, DT_UINT64]; attr=Tidx:type,default=DT_INT32,allowed=[DT_INT32, DT_INT64]> [Op:Sum] name:

Deepti_Prasad · February 29, 2024, 8:02pm

Can I know how you recalled your triplet_loss1

this should be triplet_loss1 +triplet_loss2 and not triplet_loss1, triplet_loss2

hpsv · February 29, 2024, 8:05pm

Oh you’re right, I should’ve paid more attention to the docs haha!
Thanks for your help!!

Deepti_Prasad · February 29, 2024, 8:09pm

No issues, believe me I had the same feeling when @arvyzukai helped me see my silly mistakes.

He didn’t respond to your post for long, so I responded. probably he is busy

Keep learning!!!

Regards
DP

Topic		Replies	Views
C5W4A1 Ex 3 - InvalidArgumentError Sequence Models	3	608	September 8, 2022
C3W3Assignment: Exercise 02 TripletLossFn() NLP with Sequence Models week-3	10	1192	April 28, 2024
Week 4 Assignment 1, triplet loss Convolutional Neural Networks	6	675	February 19, 2025
Course 4 Week 4 project 1 Convolutional Neural Networks	26	1703	April 11, 2022
NLP with sequence models C3_week3 Failing a unit test Question duplicates assignment part 2 triplet loss NLP with Sequence Models week-3	13	251	May 22, 2024

C3W3 NLP - Ex 2 - Struggling with data types

Related topics