How does trax handle Y variable?

arvyzukai · December 28, 2022, 8:15am

Actually model outputs are not V1 and V2 matrices (these are outputs of data_generator), model outputs are predicted similarities between V1_1 and V2_1, V1_2 and V2_2 and so on. In other words, as a concrete example (from # UNQ_C1 “Expected output”), model tries it’s best to make input V1_1:

[  30   87   78  134 2132 1981   28   78  594   21    1    1    1    1
     1    1]

to be as similar as possible to V2_1:

[  30  156   78  134 2132 9508   21    1    1    1    1    1    1    1
     1    1]

but as different as possible from V2_2:

[  30  156   78 3541 1460  131   56  253   21    1    1    1    1    1
     1    1]

Your second question:

How does trax differentiate b/w inputs and ground truth when passing values to loss functions? For Ex, CategoricalCrossEntropy needs ground truth but the custom triplet loss does not require it.

We designed TripletLossFn in such a way, that trax can now know how “good” the model is performing - TripletLossFn outputs a single number: big number = “bad”, small_number = “good”. Trax then adjusts the model weights accordingly.

You might find this answer helpful to understand the details of TripletLossFn calculations (and how it decides if model outputs are good or not).

Cheers.

Topic		Replies	Views
Question about using NN to predicting sentiment NLP with Attention Models week-module-1	1	316	September 28, 2023
How to determine the input format of tensor for the evaluation/prediction mode in Trax NLP Resources	6	164	June 27, 2023
Why the labeled_data in train_task and eval_task in our code are the tuple with three elements NLP with Sequence Models week-module-1	4	375	August 21, 2023
UNQ_C9: Model input NLP with Attention Models week-module-2	6	521	June 27, 2023
Course 3 Week 1 Excercise 8 - Test passed in notebook but not in grader NLP with Sequence Models week-module-1	4	533	September 29, 2022

How does trax handle Y variable?

Related topics