When creating the fully-fine tuned model, how is this loss computed? Like how is it compared with the labels? Is it calculated using ROUGE or BLEU score or something?
It is cross-entropy loss computed during training phase have a look on this thread:
Also you can understand how this is calculated by going though the NLP specialization offered by DLAI!
1 Like