NLP with sequence models C3_week3 Failing a unit test Question duplicates assignment part 2 triplet loss

Paul_katz · May 19, 2024, 4:04pm

Got a wrong triplet loss for inputs out: [[ 0.26726124 0.53452248 0.80178373 0.26726124 0.53452248 0.80178373]
[ 0.5178918 0.57543534 0.63297887 -0.5178918 -0.57543534 -0.63297887]], and margin 0.25
Expected:0.7035077,
Got:1.7499999920648641.

2 tests passed
1 tests failed - im little blanked out on this for now - im just posting my error as per rules if a mentor wants more lmk -

jyadav202 · May 19, 2024, 4:29pm

Hi @Paul_katz ,

have you tried out all these previous solutions and nothing worked for you? Try printing your values out per step and compare it with this explanation. Get back here if nothing worked for you.

Regards

Paul_katz · May 21, 2024, 1:23am

ok i need a little help - i get the theory justy haing trouble getting the code tofully cooperatr

arvyzukai · May 21, 2024, 5:48am

Hi @Paul_katz

Up to which point are your calculations correct and where do they start to differ?

Paul_katz · May 21, 2024, 12:29pm

that is where i began my bug hunt the first discrepancy is at mask_exclude_positives i get the correct numerical rep but not true false true true i could make the latter happen but not using the suggested tensor flow fn

arvyzukai · May 21, 2024, 12:54pm

The mask_exclude_positives is composed of two parts. As the code hint suggests:

    # create a composition of two masks: 
    # the first mask to extract the diagonal elements, 
    # the second mask to extract elements in the negative_zero_on_duplicate matrix that are larger than the elements in the diagonal 
    mask_exclude_positives = tf.cast((None)|(None),
                                    scores.dtype)

These two parts are true/false values as you can see in the image above.

To create the first mask (in place of the first None, you can make use of instructions:

To create the mask, you need to check if the cell is diagonal by computing tf.eye(batch_size) ==1

To create the second mask (in place of the second None), the instructions are:

if the non-diagonal cell is greater than the diagonal with (negative_zero_on_duplicate > tf.expand_dims(positive, 1)

These masks are compared with the | (“or”) operator between them (code already provided for you).

Finally they are “casted” to scores variable data type (this is completed for you too).

In summary, you just need to replace the both None’s with the code that is provided in the instructions.

Cheers

Paul_katz · May 21, 2024, 1:27pm

Feel a little embarassed now, i suppose i approached this out of sequence - thats an unintentional pun

Paul_katz · May 21, 2024, 1:58pm

im gonna start with a fresh notebook again looking at your breakdown it looks like my numbers are coming in inverted versus yours can i show my ouptut from my print statements now i know max_exclude_positives code is right and the output doesn’t look right to me still - certainly the loss is off now and im not passing as many unit tests

Paul_katz · May 21, 2024, 6:08pm

can i dm my code at this point

Paul_katz · May 21, 2024, 7:28pm

you have to be kidding me - ive went into a rabbit hole over a comment error - disregard im solved but someone should do something about the suggested axis value on mean_negative i dont think that was helpful to getting better -

arvyzukai · May 22, 2024, 4:51am

Do you have this comment in mind?:

# use `tf.math.reduce_sum` on `negative_zero_on_duplicate` for `axis=1` and divide it by `(batch_size - 1)

If so, what do you find confusing or not helpful about it?

Cheers

Paul_katz · May 22, 2024, 12:58pm

im just happy i got through - thinking it over next day axis =1 would not allow my assignment to pass i poured over some lines second guessing all i thought id learned -i may have put my foot in my mouth if so i apologize- in this case “axis = 0”, it means that the reduction operation is performed vertically, resulting in a single value for each column of the input tensor. is that right? the assignment would not pass with 1 as value so now im wondering if something prior is off - if its supposed to be 1- maybe there is teachable moment yet to come -

arvyzukai · May 22, 2024, 2:10pm

Your text is hard to read.

That is true. The axis= parameter specifies on which axis the reduction is performed. For 2D tensor that would mean reducing “vertically”.

For the first step - do you get the exact values in their places as in the picture?
Note that v2 is not transposed, while v1 is.
In other words, if you use tf.linalg.matmul() with parameter transpose_b=True, then the order matters - it’s v2, v1 and not v1, v2.
In one case the results are as in the picture, while in another case the result would be “flipped” (non diagonal elements would be switched).

Paul_katz · May 22, 2024, 3:52pm

well i finally got it - omg im usually trepidatious to even come to forum for help - but now im sure the 3 days werent wastedso simple in hindsight - thanks Arvyzukai - wow im blown away how long it took me - all the flags were saying it was “flipped”

Topic		Replies	Views
C3W3_Assignment - Duplicate Questions NLP with Sequence Models week-3 , assignment	4	27	April 19, 2025
C3W3 Issues with Triplet Loss NLP with Sequence Models week-3	9	37	February 15, 2025
C3W3Assignment: Exercise 02 TripletLossFn() NLP with Sequence Models week-3	10	1199	April 28, 2024
Problem with tripletloss function in C3_W4 NLP with Sequence Models week-4	37	1188	June 21, 2023
Mask_exclude_positives NLP with Sequence Models week-4	3	746	September 12, 2022

NLP with sequence models C3_week3 Failing a unit test Question duplicates assignment part 2 triplet loss

Related topics